llm-hw-2 - a spankevich Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

spankevich 's Collections

llm-hw-2

updated Mar 9

collection of ppo, dpo and reward model

spankevich/llm-hw-2-dpo

Text Generation • Updated Mar 9 • 3
spankevich/llm-hw-2-ppo

Text Generation • Updated Mar 9 • 3
spankevich/trainer_output

Text Classification • Updated Mar 9 • 1

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs