Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

8

Full-text search

Active filters: vLLM

model-scope/glm-4-9b-chat-GPTQ-Int4

Text Generation • Updated Jul 17, 2024 • 18 • 6

model-scope/glm-4-9b-chat-GPTQ-Int8

Text Generation • Updated Jul 23, 2024 • 10 • 2

tclf90/qwen2.5-72b-instruct-gptq-int4

Text Generation • Updated 3 days ago • 23

tclf90/qwen2.5-72b-instruct-gptq-int3

Text Generation • Updated 3 days ago • 6

prithivMLmods/Nu2-Lupi-Qwen-14B

Text Generation • Updated Mar 27 • 2 • 2

mradermacher/Nu2-Lupi-Qwen-14B-GGUF

Updated Mar 29 • 47 • 1

mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF

Updated Mar 29 • 92 • 1

QuantTrio/Qwen3-235B-A22B-GPTQ-Int8

Text Generation • Updated about 23 hours ago