Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

8-bit precision

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

27,966

Full-text search

Active filters: 8-bit

microsoft/bitnet-b1.58-2B-4T

Text Generation • Updated about 24 hours ago • 42.6k • 912

mlx-community/GLM-4-32B-0414-8bit

Text Generation • Updated 9 days ago • 343 • 5

MaziyarPanahi/WizardLM-2-7B-GGUF

Text Generation • Updated Apr 15, 2024 • 253k • 80

Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_MLX-8bit

Text Generation • Updated Feb 3 • 242 • 2

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8

Image-Text-to-Text • Updated 8 days ago • 1.12k • 3

lmstudio-community/Qwen3-32B-MLX-8bit

Text Generation • Updated 3 days ago • 1.05k • 2

MaziyarPanahi/Qwen3-4B-GGUF

Text Generation • Updated 3 days ago • 36.8k • 2

MaziyarPanahi/Qwen3-14B-GGUF

Text Generation • Updated 3 days ago • 36.1k • 2

MaziyarPanahi/Qwen3-32B-GGUF

Text Generation • Updated 3 days ago • 19.1k • 2

lmstudio-community/Qwen3-30B-A3B-MLX-8bit

Text Generation • Updated 3 days ago • 574 • 2

Intel/distilbert-base-uncased-distilled-squad-int8-static-inc

Question Answering • Updated Mar 29, 2024 • 2.51k • 5

CyberNative/CyberBase-13b

Text Generation • Updated May 16, 2024 • 40 • 29

asas-ai/jais_13B_8bit

Text Generation • Updated Oct 25, 2023 • 72 • 9

viai957/CodeLlama_34b-SQL

Text Generation • Updated May 4, 2024 • 54 • 1

Qwen/Qwen-7B-Chat-Int8

Text Generation • Updated Dec 13, 2023 • 166 • 8

Qwen/Qwen-14B-Chat-Int8

Text Generation • Updated Dec 13, 2023 • 170 • 6

Qwen/Qwen-1_8B-Chat-Int8

Text Generation • Updated Dec 13, 2023 • 61 • 5

Qwen/Qwen-72B-Chat-Int8

Text Generation • Updated Jan 4, 2024 • 85 • 17

lavawolfiee/Mixtral-8x7B-Instruct-v0.1-offloading-demo

Text Generation • Updated Dec 30, 2023 • 398 • 28

Flurin17/whisper-large-v3-peft-swiss-german

Updated Feb 26, 2024 • 471 • 5

MaziyarPanahi/BASH-Coder-Mistral-7B-Mistral-7B-Instruct-v0.2-slerp-GGUF

Text Generation • Updated Jan 26, 2024 • 75 • 3

MaziyarPanahi/SauerkrautLM-7b-HerO-Mistral-7B-Instruct-v0.1-GGUF

Text Generation • Updated Jan 29, 2024 • 78 • 2

MaziyarPanahi/Mixtral-8x7B-v0.1-GGUF

Text Generation • Updated Feb 4, 2024 • 117 • 1

MaziyarPanahi/rank_zephyr_7b_v1_full-GGUF

Text Ranking • Updated 29 days ago • 696 • 5

MaziyarPanahi/OPEN-SOLAR-KO-10.7B-GGUF

Text Generation • Updated Feb 4, 2024 • 85 • 1

Qwen/Qwen1.5-72B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 92 • 7

Qwen/Qwen1.5-7B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 91 • 26

Qwen/Qwen1.5-4B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 64 • 5

Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 77 • 2

Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 105 • 4