argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 1.05k • 142
cyberagent/chatbot-arena-ja-calm2-7b-chat-experimental Viewer • Updated Aug 15, 2024 • 29.2k • 207 • 19
argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 51 • 7
NickyNicky/neovalle_H4rmony_dpo_translated_English_to_Spanish Viewer • Updated May 17, 2024 • 2.02k • 16 • 5
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 41 • 4
mii-community/ultrafeedback-preferences-translated-ita Viewer • Updated Feb 21, 2024 • 60.9k • 35 • 3
NickyNicky/nano_finance_200k_en_es_chatML_gemma_orpo_dpo Viewer • Updated May 29, 2024 • 201k • 8 • 1
trl-internal-testing/hh-rlhf-helpful-base-trl-style Viewer • Updated May 2, 2024 • 46.2k • 5.71k • 10
vwxyzjn/summarize_from_feedback_oai_preprocessing_1706381144 Viewer • Updated Jan 27, 2024 • 179k • 130 • 2
macadeliccc/distilabel-neurology-preferences-2k-orca-format Viewer • Updated Feb 22, 2024 • 1.99k • 15 • 1
trl-internal-testing/descriptiveness-sentiment-trl-style Viewer • Updated Apr 9, 2024 • 10.9k • 1.5k • 1
insub/imdb_prefix20_forDPO_gpt2-large-imdb-FT_siebert_sentiment-roberta-large-english Viewer • Updated Oct 22, 2023 • 50k • 28 • 2