Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
SambaNova
Together AI
Novita
Nebius AI Studio
Hyperbolic
fal
Cerebras
Fireworks
Cohere
HF Inference API
Misc
Reset Misc
llama.cpp
Inference Endpoints
4-bit precision
text-generation-inference
AutoTrain Compatible
Merge
Eval Results
Misc with no match
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
141
Full-text search
Edit filters
Sort: Trending
Active filters:
llama.cpp
Clear all
google/gemma-1.1-7b-it-GGUF
Updated
Jun 27, 2024
•
6
•
20
google/gemma-1.1-2b-it-GGUF
Updated
Jun 27, 2024
•
3
•
20
pacozaa/bonito-gguf
Updated
Apr 14, 2024
•
7
pmking27/PrathameshLLM-2B-GGUF
Updated
Apr 9, 2024
•
6.47k
•
1
teleprint-me/cyberpunk-valerie-v0.1
Text Generation
•
Updated
Apr 18, 2024
•
39
•
1
qwp4w3hyb/Meta-Llama-3-8B-Instruct-iMat-GGUF
Text Generation
•
Updated
Apr 29, 2024
•
525
•
6
mgonzs13/Mistroll-7B-v2.2-GGUF
Text Generation
•
Updated
Apr 29, 2024
•
21
mgonzs13/ladybird-base-7B-v8-GGUF
Text Generation
•
Updated
Apr 29, 2024
•
33
google/codegemma-1.1-2b-GGUF
Text Generation
•
Updated
Jun 27, 2024
•
7
google/codegemma-1.1-7b-it-GGUF
Text Generation
•
Updated
Jun 27, 2024
•
3
•
14
mgonzs13/TextBase-7B-v0.1-GGUF
Text Generation
•
Updated
Jun 11, 2024
•
99
QuantFactory/TextBase-7B-v0.1-GGUF
Text Generation
•
Updated
Jun 18, 2024
•
78
njwright92/ComicBot_v.2-gguf
Text Generation
•
Updated
Aug 30, 2024
•
70
Irathernotsay/qwen2-1.5B-medical_qa-Finetune
Text Generation
•
Updated
Jul 17, 2024
•
5
palusi/Qwen2-0.5B-Instruct-GGUF
Updated
Jun 27, 2024
•
55
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k
Text Generation
•
Updated
Jul 9, 2024
•
15
ruslanmv/Medical-Llama3-v2-Q4_K_M-GGUF
Updated
Jun 30, 2024
•
3
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GGUF
Text Generation
•
Updated
Jul 9, 2024
•
15
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GPTQ
Text Generation
•
Updated
Jul 9, 2024
•
13
zhhan/Phi-3-mini-4k-instruct_gguf_derived
Summarization
•
Updated
Jul 2, 2024
•
33
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-AWQ
Text Generation
•
Updated
Jul 9, 2024
mgonzs13/stablelm-zephyr-3B-localmentor-GGUF
Text Generation
•
Updated
Jul 3, 2024
•
130
akshathmangudi/llama3.1-8b-gguf
Updated
Jul 26, 2024
jhilburn/gemma-inference
Text Generation
•
Updated
Aug 7, 2024
ghost-x/ghost-8b-beta-1608-gguf
Text Generation
•
Updated
Aug 26, 2024
•
111
•
6
PaulJusst/codegemma-7b-it-GGUF
Text Generation
•
Updated
Sep 13, 2024
TheCluster/Llama-3.2-3B-Instruct-GGUF
Text Generation
•
Updated
Sep 25, 2024
•
13
v000000/Typhon-Mixtral-v1-imatrix-v2.Q6_K-GGUF
Updated
Sep 26, 2024
•
9
•
1
LPN64/LongCite-llama3.1-8b-GGUF
Text Generation
•
Updated
Oct 1, 2024
•
200
•
6
cstr/Ministral-8B-Instruct-2410-GGUF
Updated
Oct 17, 2024
•
5
•
1
Previous
1
2
3
4
5
Next