underscore2
/

qwen-2.5-3b-grpo

text-generation-inference

Model card Files Files and versions Community

qwen-2.5-3b-grpo

Commit History

Trained with Unsloth

51e93fb
verified

underscore2 commited on Feb 8

Trained with Unsloth

e99c3c9
verified

underscore2 commited on Feb 8

Trained with Unsloth

c5fbffd
verified

underscore2 commited on Feb 7

Trained with Unsloth

8d67b7c
verified

underscore2 commited on Feb 7

Trained with Unsloth

cec3a42
verified

underscore2 commited on Feb 7

Upload README.md with huggingface_hub

a926051
verified

underscore2 commited on Feb 7

initial commit

6a2191e
verified

underscore2 commited on Feb 7