Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JunxiongWang 's Collections
M1
MambaInLlama_MATH_Reasoning
MambaInLlama-dpo
MambaInLlama-distill
Mamba2InLlama3.2-3B
Mamba-In-Zephyr
Mamba-In-Llama3
Mamba2-In-Llama3
MambaByte

MambaInLlama-distill

updated Nov 17, 2024

Directly distill from Llama without doing SFT and DPO

Upvote
-

  • JunxiongWang/Llama3.2-Mamba2-3B-distill

    Updated Nov 17, 2024 • 850

  • JunxiongWang/Llama3.2-Mamba-3B-distill

    Updated Nov 17, 2024 • 13

  • JunxiongWang/Llama3.1-Mamba2-8B-distill

    Updated Nov 17, 2024 • 187

  • JunxiongWang/Llama3.1-Mamba-8B-distill

    Updated Nov 17, 2024 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs