Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hamishivi 's Collections
Large-Scale Data Selection for Instruction Tuning
TESS 2
Tulu 2 Llama 3 Update
7b tulu 2.5
Tulu V2 Suite
Tulu V1 Suite
LM Preference Datasets

7b tulu 2.5

updated Mar 4

a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.

Upvote
-

  • hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm

    Text Generation • Updated Jun 25, 2024 • 7

  • hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm-value

    Token Classification • Updated Jun 25, 2024 • 1

  • hamishivi/tulu-v2.5-7b-uf-rm

    Text Classification • Updated Jun 25, 2024 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs