Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mayankagarwal 's Collections
Function Calling PO Dataset
RLHF + Code

RLHF + Code

updated Nov 22, 2024
Upvote
-

  • Vezora/Code-Preference-Pairs

    Viewer • Updated Jul 28, 2024 • 54k • 85 • 24

  • quangduc1112001/python-code-DPO-fine-tune

    Viewer • Updated Nov 4, 2024 • 2k • 43 • 2

  • xinlai/Math-Step-DPO-10K

    Viewer • Updated Jul 4, 2024 • 10.8k • 633 • 52

  • minfeng-ai/leetcode_preference

    Viewer • Updated Sep 6, 2023 • 457 • 128 • 6

  • Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1

    Viewer • Updated Aug 22, 2024 • 100k • 261 • 5

  • openbmb/UltraInteract_pair

    Viewer • Updated Apr 5, 2024 • 220k • 880 • 107

  • NextWealth/Python-DPO-Large

    Viewer • Updated Jul 2, 2024 • 957 • 34

  • interstellarninja/tool-calls-dpo

    Viewer • Updated Jan 23, 2024 • 235 • 75 • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs