Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gaunernst 's Collections
DeepSeek testing
Gemma 3 QAT INT4 (from GGUF)
Gemma 3 QAT INT4 (from Flax)
Mini BERT models
Face Recognition Models
LLMs < 1B
LLMs 1B - 2B
LLMs 2B - 4B
Smallish LLM pre-training datasets
Llama2-compatible
Llama3-compatible

DeepSeek testing

updated 30 days ago

A collection of MoE+MLA models, serving as testing proxies for DeepSeek-V3/R1

Upvote
-

  • deepseek-ai/DeepSeek-V2-Lite-Chat

    Text Generation • Updated Jun 25, 2024 • 52.6k • 123

  • gaunernst/DeepSeek-V2-Lite-Chat-FP8

    Updated Apr 7 • 180

  • TechxGenus/DeepSeek-V2-Lite-Chat-AWQ

    Text Generation • Updated Jul 4, 2024 • 1.04k • 2

  • deepseek-ai/DeepSeek-R1

    Text Generation • Updated Mar 27 • 1.27M • • 12.1k

  • meituan/DeepSeek-R1-Block-INT8

    Text Generation • Updated Feb 27 • 518 • 44

  • meituan/DeepSeek-R1-Channel-INT8

    Text Generation • Updated Feb 27 • 10.2k • 26

  • cognitivecomputations/DeepSeek-V3-AWQ

    Text Generation • Updated Mar 29 • 1.42k • 33

  • ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts

    Text Generation • Updated Apr 8 • 237 • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs