-
-
-
-
-
-
Inference Providers
Active filters:
4bit
Chun121/qwen3-4B-rpg-roleplay
Text Generation
•
Updated
•
223
•
3
mayaeary/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
70
•
40
legraphista/dolphin-2.9.2-Phi-3-Medium-abliterated-IMat-GGUF
Text Generation
•
Updated
•
1.68k
•
1
legraphista/Higgs-Llama-3-70B-IMat-GGUF
Text Generation
•
Updated
•
8.46k
•
8
legraphista/glm-4-9b-chat-IMat-GGUF
Text Generation
•
Updated
•
1.1k
•
5
legraphista/RoLlama3-8b-Instruct-IMat-GGUF
Text Generation
•
Updated
•
732
•
3
ModelCloud/gemma-2-9b-it-gptq-4bit
Text Generation
•
Updated
•
273
•
4
ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit
Text Generation
•
Updated
•
4.35k
•
4
legraphista/gemma-2-2b-it-IMat-GGUF
Text Generation
•
Updated
•
235
•
2
legraphista/Hermes-3-Llama-3.1-8B-IMat-GGUF
Text Generation
•
Updated
•
1.17k
•
1
legraphista/Hermes-3-Llama-3.1-70B-IMat-GGUF
Text Generation
•
Updated
•
303
•
1
0xroyce/Plutus-Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
Updated
•
128
•
5
legraphista/c4ai-command-r-plus-08-2024-IMat-GGUF
Text Generation
•
Updated
•
1.15k
•
6
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
309
•
15
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2
Text Generation
•
Updated
•
15
•
16
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3
Text Generation
•
Updated
•
15
•
14
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
76
•
5
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2
Text Generation
•
Updated
•
647
•
7
vital-ai/watt-tool-70B-awq
Updated
•
1.84k
•
3
curiousmind147/microsoft-phi-4-AWQ-4bit-GEMM
Text Generation
•
Updated
•
344
•
1
ConfidentialMind/Mistral-Small-24B-Instruct-2501_GPTQ_G128_W4A16_MSE
Text Classification
•
Updated
•
96
•
1
ConfidentialMind/Mistral-Small-24B-Instruct-2501_GPTQ_G32_W4A16
Text Generation
•
Updated
•
176
•
1
Deepak7376/DeepSeek-R1-Distill-Qwen-1.5B-bnb-4bit
Text Generation
•
Updated
•
14
•
1
GainEnergy/ogai-8x7b-4bit
Text Generation
•
Updated
•
1
ModelCloud/QwQ-32B-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
1.18k
•
12
Lowkey-Loki/reka-flash-3-mlx-4bit
adriabama06/ReaderLM-v2-AWQ
Text Generation
•
Updated
•
7
•
1
adriabama06/DeepCoder-1.5B-Preview-AWQ
Text Generation
•
Updated
•
80
•
2
bubblspace/Bubbl-P4-multimodal-instruct
Updated
•
244
•
3
cyberandy/SEOcrate-4B_grpo_new_01
Text Generation
•
Updated
•
43
•
1