Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

113

Full-text search

Active filters: ollama

prithivMLmods/Llama-3.2-1B-GGUF

Text Generation • Updated Nov 20, 2024 • 85 • 2

DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters

Updated 30 days ago • 110

itlwas/Llama-SmolTalk-3.2-1B-Instruct-Q4_K_M-GGUF

Text Generation • Updated Dec 28, 2024 • 19 • 1

andresdegante/papalia3

Text Generation • Updated Jan 24 • 1

GainEnergy/ogai-reasoner

Text Generation • Updated Jan 31 • 3

mradermacher/ReasonableLlama3-3B-Jr-GGUF

Updated Apr 2 • 104 • 2

dipeshmajithia/mirror_dolly

Updated 18 days ago • 7 • 1

DevItachi/Robin

Updated 12 days ago • 121 • 2

pacozaa/mistral-unsloth-chatml-first

Updated Apr 14, 2024 • 47

pacozaa/tinyllama-alpaca-lora

Updated Apr 14, 2024

pacozaa/bonito-gguf

Updated Apr 14, 2024 • 7

pacozaa/TinyLlama-1.1B-intermediate-step-1431k-3T-GGUF

Updated Apr 19, 2024 • 46

pacozaa/mistral-sharegpt90k

Updated Aug 2, 2024 • 56

pacozaa/mistral-sharegpt90k-merged_16bit

Text Generation • Updated Jul 30, 2024 • 2

TrabEsrever/dolphin-2.9-llama3-70b-GGUF

Updated Apr 29, 2024

daekeun-ml/Phi-3-medium-4k-instruct-ko-poc-gguf-v0.1

Text Generation • Updated May 26, 2024 • 15 • 1

hierholzer/Llama-3.1-70B-Instruct-GGUF

Text Generation • Updated Dec 11, 2024 • 50 • 3

LucasInsight/Meta-Llama-3.1-8B-Instruct

Updated Aug 20, 2024 • 5 • 1

LucasInsight/Meta-Llama-3-8B-Instruct

Updated Aug 20, 2024 • 6

Shyamnath/Llama-3.2-3b-Uncensored-GGUF

Text Generation • Updated Oct 21, 2024 • 17 • 3

ghost-x/ghost-8b-beta-1608-gguf

Text Generation • Updated Aug 26, 2024 • 111 • 6

cahaj/Phi-3.5-mini-instruct-text2sql-GGUF

Updated Aug 29, 2024 • 49

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python_Spanish_English_16bit

Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-extra_small_quantization_GGUF_3bit

Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python-Spanish_English_GGUF_4bit

Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_q5_k

Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_q6_k

Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python-GGUF_Spanish_English_8bit

Updated Sep 2, 2024

Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python_English_GGUF_16bit

Updated Sep 2, 2024 • 1

Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_32bit

Updated Sep 2, 2024