Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
VerlTool
/
torl-fsdp_agent-qwen_qwen2.5-math-1.5b-grpo-n16-b128-t1.0-lr1e-6torl_same_train-310-step
like
0
Follow
VerlTool
2
Safetensors
qwen2
Model card
Files
Files and versions
Community
5768f2a
torl-fsdp_agent-qwen_qwen2.5-math-1.5b-grpo-n16-b128-t1.0-lr1e-6torl_same_train-310-step
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
DongfuJiang
initial commit
5768f2a
verified
10 days ago
.gitattributes
Safe
1.52 kB
initial commit
10 days ago