Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
spankevich
's Collections
llm-hw-3
llm-hw-2
llm-hw-2
updated
Mar 9
collection of ppo, dpo and reward model
Upvote
1
spankevich/llm-hw-2-dpo
Text Generation
•
Updated
Mar 9
•
3
spankevich/llm-hw-2-ppo
Text Generation
•
Updated
Mar 9
•
3
spankevich/trainer_output
Text Classification
•
Updated
Mar 9
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections