-
-
-
-
-
-
Inference Providers
Active filters:
int4
modelscope/Yi-1.5-9B-Chat-GPTQ
Text Generation
•
Updated
•
12
•
1
modelscope/Yi-1.5-9B-Chat-AWQ
Text Generation
•
Updated
•
5
modelscope/Yi-1.5-34B-Chat-GPTQ
Text Generation
•
Updated
•
14
•
1
jojo1899/Phi-3-mini-128k-instruct-ov-int4
Text Generation
•
Updated
•
5
jojo1899/Llama-2-13b-chat-hf-ov-int4
Text Generation
•
Updated
•
5
jojo1899/Mistral-7B-Instruct-v0.2-ov-int4
Text Generation
•
Updated
•
17
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
Updated
•
6
•
6
ModelCloud/Mistral-Nemo-Instruct-2407-gptq-4bit
Text Generation
•
Updated
•
25
•
4
ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit
Text Generation
•
Updated
•
4.67k
•
4
ModelCloud/Meta-Llama-3.1-8B-gptq-4bit
Text Generation
•
Updated
•
41
ModelCloud/Meta-Llama-3.1-70B-Instruct-gptq-4bit
Text Generation
•
Updated
•
213
•
4
ModelCloud/Mistral-Large-Instruct-2407-gptq-4bit
Text Generation
•
Updated
•
5
•
1
angeloc1/llama3dot1SimilarProcesses4
Text Generation
•
Updated
•
6
angeloc1/llama3dot1DifferentProcesses4
Text Generation
•
Updated
•
5
ModelCloud/Meta-Llama-3.1-405B-Instruct-gptq-4bit
Text Generation
•
Updated
•
2
•
2
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
10.1k
•
32
ModelCloud/EXAONE-3.0-7.8B-Instruct-gptq-4bit
Updated
•
13
•
3
RedHatAI/Meta-Llama-3.1-405B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
455
•
12
angeloc1/llama3dot1FoodDel4v05
Text Generation
•
Updated
•
4
zzzmahesh/Meta-Llama-3-8B-Instruct-quantized.w4a4
Text Generation
•
Updated
•
49
•
1
ModelCloud/GRIN-MoE-gptq-4bit
joshmiller656/Llama3.2-1B-AWQ-INT4
Updated
•
33
Advantech-EIOT/intel_llama-3.1-8b-instruct
Updated
RedHatAI/Qwen2.5-7B-quantized.w4a16
Text Generation
•
Updated
•
107
joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4
Text Generation
•
Updated
•
520
•
2
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
324
•
2
jojo1899/llama-3_1-8b-instruct-ov-int4
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2
Text Generation
•
Updated
•
28
•
3
ModelCloud/Llama-3.2-3B-Instruct-gptqmodel-4bit-vortex-v3
Text Generation
•
Updated
•
101
•
5
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
Updated
•
2