Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lomahony 's Collections
Pythia-hh-all-sft-dpo
pythia-helpful-1epoch
pythia-helpful-epoch2
Pythia-helpful 3 epochs

Pythia-hh-all-sft-dpo

updated Mar 12, 2024

Pythia models supervised finetuned and DPO finetuned with all of Anthropic-hh-rlhf dataset for 1 epoch.

Upvote
-

  • lomahony/eleuther-pythia160m-hh-sft

    Text Generation • Updated Aug 12, 2023 • 27

  • lomahony/eleuther-pythia2.8b-hh-sft

    Text Generation • Updated Aug 12, 2023 • 46 • 1

  • lomahony/eleuther-pythia410m-hh-sft

    Text Generation • Updated Aug 12, 2023 • 34

  • lomahony/eleuther-pythia6.9b-hh-dpo

    Text Generation • Updated Aug 12, 2023 • 223

  • lomahony/eleuther-pythia70m-hh-sft

    Text Generation • Updated Aug 12, 2023 • 65

  • lomahony/eleuther-pythia12b-hh-sft0

    Text Generation • Updated Aug 12, 2023 • 20

  • lomahony/eleuther-pythia12b-hh-sft

    Text Generation • Updated Aug 31, 2023 • 7

  • lomahony/eleuther-pythia70m-hh-dpo

    Text Generation • Updated Aug 12, 2023 • 25

  • lomahony/eleuther-pythia160m-hh-dpo

    Text Generation • Updated Aug 12, 2023 • 8

  • lomahony/eleuther-pythia410m-hh-dpo

    Text Generation • Updated Aug 12, 2023 • 20

  • lomahony/eleuther-pythia2.8b-hh-dpo

    Text Generation • Updated Aug 12, 2023 • 27 • 1

  • lomahony/eleuther-pythia12b-hh-dpo

    Text Generation • Updated Aug 31, 2023 • 6

  • lomahony/eleuther-pythia6.9b-hh-sft

    Text Generation • Updated Aug 12, 2023 • 64 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs