esmt-grpo-good-16bit / model-00001-of-00002.safetensors

Commit History

Trained with Unsloth
0197d84
verified

li-ping commited on