qwen-2.5-3b-grpo-v4 / adapter_model.safetensors

Commit History

Trained with Unsloth
70d24d3
verified

underscore2 commited on