Fine-tuning is based on the foundation model version v2024.12.28, and it uses self-prepared instruction datasets for this round of fine-tuning. 44cd75b lianghsun commited on Jan 1
Completed SFT training (5/5 epochs). Preparing for multi-round DPO training. ad1233d lianghsun commited on Nov 27, 2024
Updated model version to v2024.11.25, training progressed to (3/10) epochs. Still in SFT stage, DPO training remains pending. 7967d13 lianghsun commited on Nov 25, 2024
Initial upload: Model version v2024.11.22, training completed up to (1/10) epochs. ac505d8 lianghsun commited on Nov 22, 2024