underscore2
/

llama3.2-3b-kto-rl-test-v3-stage5

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llama3.2-3b-kto-rl-test-v3-stage5

1 contributor

History: 5 commits

underscore2's picture

Trained with Unsloth

3ac63b3 verified about 2 months ago