Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

4-bit precision

text-generation-inference

AutoTrain Compatible

Misc with no match

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

141

Full-text search

Active filters: llama.cpp

google/gemma-1.1-7b-it-GGUF

Updated Jun 27, 2024 • 6 • 20

google/gemma-1.1-2b-it-GGUF

Updated Jun 27, 2024 • 3 • 20

pacozaa/bonito-gguf

Updated Apr 14, 2024 • 7

pmking27/PrathameshLLM-2B-GGUF

Updated Apr 9, 2024 • 6.47k • 1

teleprint-me/cyberpunk-valerie-v0.1

Text Generation • Updated Apr 18, 2024 • 39 • 1

qwp4w3hyb/Meta-Llama-3-8B-Instruct-iMat-GGUF

Text Generation • Updated Apr 29, 2024 • 525 • 6

mgonzs13/Mistroll-7B-v2.2-GGUF

Text Generation • Updated Apr 29, 2024 • 21

mgonzs13/ladybird-base-7B-v8-GGUF

Text Generation • Updated Apr 29, 2024 • 33

google/codegemma-1.1-2b-GGUF

Text Generation • Updated Jun 27, 2024 • 7

google/codegemma-1.1-7b-it-GGUF

Text Generation • Updated Jun 27, 2024 • 3 • 14

mgonzs13/TextBase-7B-v0.1-GGUF

Text Generation • Updated Jun 11, 2024 • 99

QuantFactory/TextBase-7B-v0.1-GGUF

Text Generation • Updated Jun 18, 2024 • 78

njwright92/ComicBot_v.2-gguf

Text Generation • Updated Aug 30, 2024 • 70

Irathernotsay/qwen2-1.5B-medical_qa-Finetune

Text Generation • Updated Jul 17, 2024 • 5

palusi/Qwen2-0.5B-Instruct-GGUF

Updated Jun 27, 2024 • 55

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k

Text Generation • Updated Jul 9, 2024 • 15

ruslanmv/Medical-Llama3-v2-Q4_K_M-GGUF

Updated Jun 30, 2024 • 3

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GGUF

Text Generation • Updated Jul 9, 2024 • 15

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GPTQ

Text Generation • Updated Jul 9, 2024 • 13

zhhan/Phi-3-mini-4k-instruct_gguf_derived

Summarization • Updated Jul 2, 2024 • 33

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-AWQ

Text Generation • Updated Jul 9, 2024

mgonzs13/stablelm-zephyr-3B-localmentor-GGUF

Text Generation • Updated Jul 3, 2024 • 130

akshathmangudi/llama3.1-8b-gguf

Updated Jul 26, 2024

jhilburn/gemma-inference

Text Generation • Updated Aug 7, 2024

ghost-x/ghost-8b-beta-1608-gguf

Text Generation • Updated Aug 26, 2024 • 111 • 6

PaulJusst/codegemma-7b-it-GGUF

Text Generation • Updated Sep 13, 2024

TheCluster/Llama-3.2-3B-Instruct-GGUF

Text Generation • Updated Sep 25, 2024 • 13

v000000/Typhon-Mixtral-v1-imatrix-v2.Q6_K-GGUF

Updated Sep 26, 2024 • 9 • 1

LPN64/LongCite-llama3.1-8b-GGUF

Text Generation • Updated Oct 1, 2024 • 200 • 6

cstr/Ministral-8B-Instruct-2410-GGUF

Updated Oct 17, 2024 • 5 • 1