Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
qingyangzhang
/
Qwen2.5-7B-EMPO-TruthfulQA
like
0
Text Generation
Transformers
Safetensors
truthfulqa/truthful_qa
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-7B-EMPO-TruthfulQA
/
model-00003-of-00004.safetensors
Commit History
Training in progress, epoch 1
768e0ba
verified
qingyangzhang
commited on
Mar 27
Training in progress, epoch 1
df16343
verified
qingyangzhang
commited on
Mar 27
Training in progress, epoch 1
e9b6726
verified
qingyangzhang
commited on
Mar 27
Training in progress, epoch 1
c8f744e
verified
qingyangzhang
commited on
Mar 27
Training in progress, epoch 1
d77e6e8
verified
qingyangzhang
commited on
Mar 27
Training in progress, epoch 1
23ac4ef
verified
qingyangzhang
commited on
Mar 27
Training in progress, step 15
bfc1b6b
verified
qingyangzhang
commited on
Mar 25
Training in progress, step 10
ecca974
verified
qingyangzhang
commited on
Mar 25
Training in progress, step 15
ecc89e2
verified
qingyangzhang
commited on
Mar 25
Training in progress, step 10
bcec2ac
verified
qingyangzhang
commited on
Mar 25
Training in progress, step 5
b8b8e7e
verified
qingyangzhang
commited on
Mar 25