Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JunxiongWang 's Collections
M1
MambaInLlama_MATH_Reasoning
MambaInLlama-dpo
MambaInLlama-distill
Mamba2InLlama3.2-3B
Mamba-In-Zephyr
Mamba-In-Llama3
Mamba2-In-Llama3
MambaByte

Mamba2InLlama3.2-3B

updated Nov 17, 2024

Mamba distilled from Llama3.2 3B Instruct. The Mamba in the Llama: Distilling and Accelerating Hybrid Models (https://arxiv.org/abs/2408.15237).

Upvote
-

  • JunxiongWang/Llama3.2-Mamba2-3B-dpo

    Updated Nov 17, 2024 • 11

  • JunxiongWang/Llama3.2-Mamba2-3B-distill

    Updated Nov 17, 2024 • 850

  • JunxiongWang/Llama3.2-Mamba-3B-distill

    Updated Nov 17, 2024 • 13

  • JunxiongWang/Llama3.2-Mamba-3B-dpo

    Updated Nov 17, 2024 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs