Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Misc with no match

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

14

Full-text search

Active filters: smooth_quant

internlm/internlm3-8b-instruct-smoothquant-int8

Text Generation • Updated Jan 15 • 38 • 4

internlm/internlm3-8b-instruct-smoothquant-fp8

Text Generation • Updated Jan 17 • 53 • 1

fabiolecca/almawave-velvet-14b-int8

Updated Feb 1 • 28 • 2

noneUsername/Orca-2-13b-w8-lmdeploy

Updated Nov 11, 2024 • 10

G-reen/Qwen2.5-Coder-32b-Instruct-Fp8

Updated Feb 10 • 18

G-reen/Mistral-Small-2501-Instruct-Fp8

Updated Feb 9 • 14

radna/r1-14b-fp8

Updated 18 days ago • 56

radna/r1-7b-fp8

Updated 19 days ago • 12

radna/r1-14b-int8

Updated 18 days ago • 19

radna/r1-14b-float8_e4m3fn

Updated 18 days ago • 11

radna/r1-14b-float8_e5m2

Updated 18 days ago • 15

radna/r1-14b-int8-mid

Updated 18 days ago • 25

radna/r1-14b-float8_e4m3fn-mid

Updated 18 days ago • 28

radna/r1-14b-float8_e5m2-mid

Updated 18 days ago • 16