Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
VerlTool
/
torl-fsdp_agent-qwen_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6new-220-step
like
0
Follow
VerlTool
2
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
torl-fsdp_agent-qwen_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6new-220-step
Commit History
Upload folder using huggingface_hub
e308c43
verified
DongfuJiang
commited on
10 days ago
Upload folder using huggingface_hub
5e0421b
verified
DongfuJiang
commited on
10 days ago
initial commit
295ee16
verified
DongfuJiang
commited on
10 days ago