Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Hyperbolic
Cohere
fal
Nebius AI Studio
SambaNova
Fireworks
Cerebras
Novita
Replicate
HF Inference API
Misc
Reset Misc
int8
Inference Endpoints
AutoTrain Compatible
8-bit precision
text-generation-inference
Eval Results
text-embeddings-inference
custom_code
4-bit precision
Misc with no match
Merge
Carbon Emissions
Mixture of Experts
Apply filters
Models
266
Full-text search
Edit filters
Sort: Trending
Active filters:
int8
Clear all
RedHatAI/Qwen2.5-1.5B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
6
RedHatAI/Qwen2.5-3B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
1
RedHatAI/Qwen2.5-7B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
42
•
1
RedHatAI/Qwen2.5-32B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
13
RedHatAI/Qwen2.5-72B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
8
avans06/Meta-Llama-3.1-8B-Instruct-ct2-int8_float16
Text Generation
•
Updated
Oct 10, 2024
•
11
avans06/Meta-Llama-3.2-8B-Instruct-ct2-int8_float16
Text Generation
•
Updated
Oct 13, 2024
•
23
minpeter/Qwen-Qwen2.5-14B-Instruct-fmo-int8
Updated
Nov 8, 2024
•
3
minpeter/Qwen-Qwen2.5-32B-Instruct-fmo-int8
Updated
Nov 8, 2024
•
15
SteveTran/T5-small-query-expansion-INT8
Text2Text Generation
•
Updated
Nov 16, 2024
•
5
mradermacher/ecastera-eva-westlake-7b-spanish-GGUF
Updated
Dec 22, 2024
•
300
NeoChen1024/Dolphin3.0-Llama3.1-8B-W8A8
Updated
15 days ago
•
8
NeoChen1024/dolphin-2.9.3-mistral-7B-32k-W8A8
Updated
Jan 6
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
Updated
Feb 28
•
2.83k
•
1
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
Updated
Feb 28
•
371
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
Updated
Feb 28
•
327
RedHatAI/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
Updated
Feb 28
•
138
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16
Text Generation
•
Updated
Jan 31
•
162
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8
Text Generation
•
Updated
Feb 27
•
209
•
2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8
Text Generation
•
Updated
Feb 27
•
140
•
2
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8
Text Generation
•
Updated
Feb 27
•
5.72k
•
3
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8
Text Generation
•
Updated
Feb 27
•
97
•
1
RedHatAI/Pixtral-Large-Instruct-2411-hf-quantized.w8a8
Image-Text-to-Text
•
Updated
Mar 31
•
26
RedHatAI/phi-4-quantized.w8a8
Text Generation
•
Updated
17 days ago
•
76
labaispeak/stable-diffusion-2-1-openvino-int8
Text-to-Image
•
Updated
Mar 25
RedHatAI/Qwen2.5-7B-Instruct-quantized.w4a16
Text Generation
•
Updated
17 days ago
•
116
Previous
1
...
7
8
9
Next