Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
underscore2
/
llama3.2-3b-kto-rl-test-v3-stage5
like
0
Transformers
Safetensors
English
text-generation-inference
unsloth
llama
trl
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama3.2-3b-kto-rl-test-v3-stage5
Commit History
Trained with Unsloth
3ac63b3
verified
underscore2
commited on
Jan 30
Trained with Unsloth
486a6ae
verified
underscore2
commited on
Jan 30
Trained with Unsloth
120575a
verified
underscore2
commited on
Jan 30
Upload README.md with huggingface_hub
9b29f88
verified
underscore2
commited on
Jan 30
initial commit
96324f6
verified
underscore2
commited on
Jan 30