Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

8-bit precision

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

27,974

Full-text search

Active filters: 8-bit

microsoft/bitnet-b1.58-2B-4T

Text Generation • Updated 1 day ago • 42.6k • 916

mlx-community/GLM-4-32B-0414-8bit

Text Generation • Updated 10 days ago • 343 • 5

MaziyarPanahi/WizardLM-2-7B-GGUF

Text Generation • Updated Apr 15, 2024 • 270k • 80

MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF

Text Generation • Updated May 22, 2024 • 308k • 95

Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_MLX-8bit

Text Generation • Updated Feb 3 • 242 • 2

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8

Image-Text-to-Text • Updated 9 days ago • 1.12k • 3

lmstudio-community/Qwen3-32B-MLX-8bit

Text Generation • Updated 4 days ago • 1.05k • 2

MaziyarPanahi/Qwen3-4B-GGUF

Text Generation • Updated 4 days ago • 36.8k • 2

MaziyarPanahi/Qwen3-14B-GGUF

Text Generation • Updated 4 days ago • 36.1k • 2

mlx-community/Qwen3-30B-A3B-8bit

Text Generation • Updated 4 days ago • 805 • 2

MaziyarPanahi/Qwen3-32B-GGUF

Text Generation • Updated 3 days ago • 19.1k • 2

lmstudio-community/Qwen3-30B-A3B-MLX-8bit

Text Generation • Updated 4 days ago • 574 • 2

Intel/distilbert-base-uncased-distilled-squad-int8-static-inc

Question Answering • Updated Mar 29, 2024 • 2.54k • 5

CyberNative/CyberBase-13b

Text Generation • Updated May 16, 2024 • 45 • 29

asas-ai/jais_13B_8bit

Text Generation • Updated Oct 25, 2023 • 81 • 9

viai957/CodeLlama_34b-SQL

Text Generation • Updated May 4, 2024 • 59 • 1

Qwen/Qwen-7B-Chat-Int8

Text Generation • Updated Dec 13, 2023 • 173 • 8

Qwen/Qwen-14B-Chat-Int8

Text Generation • Updated Dec 13, 2023 • 91 • 6

Qwen/Qwen-1_8B-Chat-Int8

Text Generation • Updated Dec 13, 2023 • 70 • 5

Qwen/Qwen-72B-Chat-Int8

Text Generation • Updated Jan 4, 2024 • 94 • 17

lavawolfiee/Mixtral-8x7B-Instruct-v0.1-offloading-demo

Text Generation • Updated Dec 30, 2023 • 383 • 28

Flurin17/whisper-large-v3-peft-swiss-german

Updated Feb 26, 2024 • 396 • 5

MaziyarPanahi/BASH-Coder-Mistral-7B-Mistral-7B-Instruct-v0.2-slerp-GGUF

Text Generation • Updated Jan 26, 2024 • 71 • 3

MaziyarPanahi/SauerkrautLM-7b-HerO-Mistral-7B-Instruct-v0.1-GGUF

Text Generation • Updated Jan 29, 2024 • 81 • 2

MaziyarPanahi/Mixtral-8x7B-v0.1-GGUF

Text Generation • Updated Feb 4, 2024 • 124 • 1

MaziyarPanahi/rank_zephyr_7b_v1_full-GGUF

Text Ranking • Updated 30 days ago • 680 • 5

MaziyarPanahi/OPEN-SOLAR-KO-10.7B-GGUF

Text Generation • Updated Feb 4, 2024 • 90 • 1

Qwen/Qwen1.5-72B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 97 • 7

Qwen/Qwen1.5-7B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 99 • 26

Qwen/Qwen1.5-4B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 67 • 5