Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
EleutherAI
/
unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-vuln
like
0
Follow
EleutherAI
834
Safetensors
Model card
Files
Files and versions
Community
0097db5
unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-vuln
/
training_args.bin
Commit History
Training in progress, step 250
0097db5
verified
davidoj01
commited on
7 days ago
Training in progress, step 50
d3cc456
verified
davidoj01
commited on
8 days ago
Training in progress, step 50
72d042c
verified
davidoj01
commited on
9 days ago
Training in progress, step 300
bde1a6d
verified
davidoj01
commited on
13 days ago
Training in progress, step 150
5cc7d99
verified
davidoj01
commited on
15 days ago
Training in progress, step 50
964c926
verified
davidoj01
commited on
16 days ago