phi4-14b-grpo-reasoning-merged_16bit / model-00005-of-00006.safetensors

Commit History

Trained with Unsloth
c23829b
verified

adrianoamalfi commited on

Trained with Unsloth
41445d5
verified

adrianoamalfi commited on