Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nbeerbower
/
Dumpling-Mistral-Nemo-8B

Text Generation
Transformers
Safetensors
mistral
conversational
text-generation-inference
Model card Files Files and versions Community
  • Dumpling-Mistral-Nemo-8B
    • Method

    🧪 Experimental

    An attempt to recover intelligence with a quick train, results are meh

    Dumpling-Mistral-Nemo-8B

    nbeerbower/mistral-nemo-kartoffel-PRUNE3 finetuned on:

    • nbeerbower/GreatFirewall-DPO
    • nbeerbower/Schule-DPO
    • nbeerbower/Purpura-DPO
    • nbeerbower/Arkhaios-DPO
    • jondurbin/truthy-dpo-v0.1
    • antiven0m/physical-reasoning-dpo
    • flammenai/Date-DPO-NoAsterisks
    • flammenai/Prude-Phi3-DPO
    • Atsunori/HelpSteer2-DPO (1,000 samples)
    • jondurbin/gutenberg-dpo-v0.1
    • nbeerbower/gutenberg2-dpo
    • nbeerbower/gutenberg-moderne-dpo.

    Method

    QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.

    Downloads last month
    2
    Safetensors
    Model size
    8.43B params
    Tensor type
    BF16
    ·
    Inference Providers NEW
    Text Generation
    This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

    Model tree for nbeerbower/Dumpling-Mistral-Nemo-8B

    Base model

    nbeerbower/Mahou-1.5-mistral-nemo-12B-lorablated
    Finetuned
    nbeerbower/mistral-nemo-kartoffel-12B
    Finetuned
    nbeerbower/mistral-nemo-kartoffel-PRUNE3
    Finetuned
    (1)
    this model
    Quantizations
    8 models

    Datasets used to train nbeerbower/Dumpling-Mistral-Nemo-8B

    jondurbin/gutenberg-dpo-v0.1

    Viewer • Updated Jan 12, 2024 • 918 • 855 • 142

    jondurbin/truthy-dpo-v0.1

    Viewer • Updated Jan 11, 2024 • 1.02k • 301 • 134

    Atsunori/HelpSteer2-DPO

    Viewer • Updated Jul 11, 2024 • 7.59k • 87 • 8
    Company
    TOS Privacy About Jobs
    Website
    Models Datasets Spaces Pricing Docs