Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

4-bit precision

AutoTrain Compatible

Inference Endpoints

text-generation-inference

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

248

Full-text search

Active filters: 4bit

Chun121/qwen3-4B-rpg-roleplay

Text Generation • Updated 3 days ago • 223 • 3

mayaeary/pygmalion-6b-4bit-128g

Text Generation • Updated Mar 28, 2023 • 70 • 40

legraphista/dolphin-2.9.2-Phi-3-Medium-abliterated-IMat-GGUF

Text Generation • Updated Jun 3, 2024 • 1.68k • 1

legraphista/Higgs-Llama-3-70B-IMat-GGUF

Text Generation • Updated Jun 6, 2024 • 8.46k • 8

legraphista/glm-4-9b-chat-IMat-GGUF

Text Generation • Updated Jun 20, 2024 • 1.1k • 5

legraphista/RoLlama3-8b-Instruct-IMat-GGUF

Text Generation • Updated Jun 23, 2024 • 732 • 3

ModelCloud/gemma-2-9b-it-gptq-4bit

Text Generation • Updated Jul 9, 2024 • 273 • 4

ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit

Text Generation • Updated Jul 29, 2024 • 4.35k • 4

legraphista/gemma-2-2b-it-IMat-GGUF

Text Generation • Updated Jul 31, 2024 • 235 • 2

legraphista/Hermes-3-Llama-3.1-8B-IMat-GGUF

Text Generation • Updated Aug 16, 2024 • 1.17k • 1

legraphista/Hermes-3-Llama-3.1-70B-IMat-GGUF

Text Generation • Updated Aug 16, 2024 • 303 • 1

0xroyce/Plutus-Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • Updated Jan 12 • 128 • 5

legraphista/c4ai-command-r-plus-08-2024-IMat-GGUF

Text Generation • Updated Aug 31, 2024 • 1.15k • 6

ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1

Text Generation • Updated Nov 14, 2024 • 309 • 15

ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2

Text Generation • Updated Dec 18, 2024 • 15 • 16

ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3

Text Generation • Updated Dec 20, 2024 • 15 • 14

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1

Text Generation • Updated Jan 24 • 76 • 5

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2

Text Generation • Updated Jan 24 • 647 • 7

vital-ai/watt-tool-70B-awq

Updated Jan 24 • 1.84k • 3

curiousmind147/microsoft-phi-4-AWQ-4bit-GEMM

Text Generation • Updated Feb 4 • 344 • 1

ConfidentialMind/Mistral-Small-24B-Instruct-2501_GPTQ_G128_W4A16_MSE

Text Classification • Updated Feb 18 • 96 • 1

ConfidentialMind/Mistral-Small-24B-Instruct-2501_GPTQ_G32_W4A16

Text Generation • Updated Feb 23 • 176 • 1

Deepak7376/DeepSeek-R1-Distill-Qwen-1.5B-bnb-4bit

Text Generation • Updated Feb 25 • 14 • 1

GainEnergy/ogai-8x7b-4bit

Text Generation • Updated Mar 5 • 1

ModelCloud/QwQ-32B-gptqmodel-4bit-vortex-v1

Text Generation • Updated Mar 9 • 1.18k • 12

Lowkey-Loki/reka-flash-3-mlx-4bit

Updated Mar 11 • 5 • 1

adriabama06/ReaderLM-v2-AWQ

Text Generation • Updated Mar 18 • 7 • 1

adriabama06/DeepCoder-1.5B-Preview-AWQ

Text Generation • Updated 20 days ago • 80 • 2

bubblspace/Bubbl-P4-multimodal-instruct

Updated 18 days ago • 244 • 3

cyberandy/SEOcrate-4B_grpo_new_01

Text Generation • Updated 5 days ago • 43 • 1