Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
fal
Novita
Nscale
Replicate
Fireworks
Together AI
Hyperbolic
Cohere
Cerebras
Nebius AI Studio
SambaNova
HF Inference API
Misc
vLLM
Inference Endpoints
text-generation-inference
4-bit precision
8-bit precision
custom_code

Misc with no match

Eval Results
Merge
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

8
Full-text search
Active filters: vLLM

model-scope/glm-4-9b-chat-GPTQ-Int4

Text Generation • Updated Jul 17, 2024 • 18 • 6

model-scope/glm-4-9b-chat-GPTQ-Int8

Text Generation • Updated Jul 23, 2024 • 10 • 2

tclf90/qwen2.5-72b-instruct-gptq-int4

Text Generation • Updated 3 days ago • 23

tclf90/qwen2.5-72b-instruct-gptq-int3

Text Generation • Updated 3 days ago • 6

prithivMLmods/Nu2-Lupi-Qwen-14B

Text Generation • Updated Mar 27 • 2 • 2

mradermacher/Nu2-Lupi-Qwen-14B-GGUF

Updated Mar 29 • 47 • 1

mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF

Updated Mar 29 • 92 • 1

QuantTrio/Qwen3-235B-A22B-GPTQ-Int8

Text Generation • Updated about 23 hours ago
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs