Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

llm-blender
/
PairRM

Text Generation
Transformers
Safetensors
English
deberta
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Model card Files Files and versions Community
4
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

I wonder if there is any possibility that you can make your collected pairwise dataset open-source ?

1
#4 opened over 1 year ago by
fakerbaby

What's the difference between llm-blender/PairRM and llm-blender/pair-ranker?

2
#3 opened over 1 year ago by
nefelibata-mu

Run model with transformers?

2
#2 opened over 1 year ago by
lewtun

please make some examples

1
#1 opened over 1 year ago by
eramax
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs