Qwen2.5-3B-DPO-Batch-8-LR-4e-5 / model-00003-of-00003.safetensors

Commit History

Upload merged Qwen2.5-3B-DPO-Batch-8-LR-4e-5
d22b2b3
verified

Elfsong commited on