Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JunxiongWang 's Collections
M1
MambaInLlama_MATH_Reasoning
MambaInLlama-dpo
MambaInLlama-distill
Mamba2InLlama3.2-3B
Mamba-In-Zephyr
Mamba-In-Llama3
Mamba2-In-Llama3
MambaByte

MambaInLlama-dpo

updated Nov 17, 2024

Directly distill from Llama, the finetune in DPO

Upvote
-

  • JunxiongWang/Llama3.1-Mamba2-8B-dpo

    Updated Nov 17, 2024 • 1

  • JunxiongWang/Llama3.1-Mamba-8B-dpo

    Updated Nov 17, 2024

  • JunxiongWang/Llama3.2-Mamba2-3B-dpo

    Updated Nov 17, 2024 • 11

  • JunxiongWang/Llama3.2-Mamba-3B-dpo

    Updated Nov 17, 2024 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs