-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
Updated
•
42.6k
•
916
mlx-community/GLM-4-32B-0414-8bit
Text Generation
•
Updated
•
343
•
5
MaziyarPanahi/WizardLM-2-7B-GGUF
Text Generation
•
Updated
•
270k
•
80
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
Updated
•
308k
•
95
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_MLX-8bit
Text Generation
•
Updated
•
242
•
2
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text
•
Updated
•
1.12k
•
3
lmstudio-community/Qwen3-32B-MLX-8bit
Text Generation
•
Updated
•
1.05k
•
2
MaziyarPanahi/Qwen3-4B-GGUF
Text Generation
•
Updated
•
36.8k
•
2
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
•
Updated
•
36.1k
•
2
mlx-community/Qwen3-30B-A3B-8bit
Text Generation
•
Updated
•
805
•
2
MaziyarPanahi/Qwen3-32B-GGUF
Text Generation
•
Updated
•
19.1k
•
2
lmstudio-community/Qwen3-30B-A3B-MLX-8bit
Text Generation
•
Updated
•
574
•
2
Intel/distilbert-base-uncased-distilled-squad-int8-static-inc
Question Answering
•
Updated
•
2.54k
•
5
CyberNative/CyberBase-13b
Text Generation
•
Updated
•
45
•
29
asas-ai/jais_13B_8bit
Text Generation
•
Updated
•
81
•
9
viai957/CodeLlama_34b-SQL
Text Generation
•
Updated
•
59
•
1
Qwen/Qwen-7B-Chat-Int8
Text Generation
•
Updated
•
173
•
8
Qwen/Qwen-14B-Chat-Int8
Text Generation
•
Updated
•
91
•
6
Qwen/Qwen-1_8B-Chat-Int8
Text Generation
•
Updated
•
70
•
5
Qwen/Qwen-72B-Chat-Int8
Text Generation
•
Updated
•
94
•
17
lavawolfiee/Mixtral-8x7B-Instruct-v0.1-offloading-demo
Text Generation
•
Updated
•
383
•
28
Flurin17/whisper-large-v3-peft-swiss-german
Updated
•
396
•
5
MaziyarPanahi/BASH-Coder-Mistral-7B-Mistral-7B-Instruct-v0.2-slerp-GGUF
Text Generation
•
Updated
•
71
•
3
MaziyarPanahi/SauerkrautLM-7b-HerO-Mistral-7B-Instruct-v0.1-GGUF
Text Generation
•
Updated
•
81
•
2
MaziyarPanahi/Mixtral-8x7B-v0.1-GGUF
Text Generation
•
Updated
•
124
•
1
MaziyarPanahi/rank_zephyr_7b_v1_full-GGUF
Text Ranking
•
Updated
•
680
•
5
MaziyarPanahi/OPEN-SOLAR-KO-10.7B-GGUF
Text Generation
•
Updated
•
90
•
1
Qwen/Qwen1.5-72B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
97
•
7
Qwen/Qwen1.5-7B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
99
•
26
Qwen/Qwen1.5-4B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
67
•
5