Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
minhopark-neubla 's Collections
efficient llm
model architecture
survey
pretrain
reasoning

model architecture

updated Jan 9, 2024
Upvote
-

  • Mixtral of Experts

    Paper • 2401.04088 • Published Jan 8, 2024 • 159

  • MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

    Paper • 2401.04081 • Published Jan 8, 2024 • 71

  • TinyLlama: An Open-Source Small Language Model

    Paper • 2401.02385 • Published Jan 4, 2024 • 95

  • LLaMA Pro: Progressive LLaMA with Block Expansion

    Paper • 2401.02415 • Published Jan 4, 2024 • 54

  • LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model

    Paper • 2401.02330 • Published Jan 4, 2024 • 17
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs