matrixportal
/

Gemmasutra-Small-4B-v1-GGUF

Text Generation

Model card Files Files and versions

Gemmasutra-Small-4B-v1 GGUF Quantized Models

Model Information

Base Model: TheDrummer/Gemmasutra-Small-4B-v1
Quantized by: matrixportal
Format: GGUF (for llama.cpp compatible tools)
Quantized on: 2025-04-09

Recommended Downloads

Q4_K_M: gemmasutra-small-4b-v1.q4_k_m.gguf
Q4_0: gemmasutra-small-4b-v1.q4_0.gguf
Q8_0: gemmasutra-small-4b-v1.q8_0.gguf

All Available Quantizations

File	Download
`gemmasutra-small-4b-v1.f16.gguf`	Download
`gemmasutra-small-4b-v1.q2_k.gguf`	Download
`gemmasutra-small-4b-v1.q3_k_m.gguf`	Download
`gemmasutra-small-4b-v1.q4_0.gguf`	Download
`gemmasutra-small-4b-v1.q4_k_m.gguf`	Download
`gemmasutra-small-4b-v1.q5_k_m.gguf`	Download
`gemmasutra-small-4b-v1.q6_k.gguf`	Download
`gemmasutra-small-4b-v1.q8_0.gguf`	Download

Usage Instructions

Download desired GGUF file
Use with compatible tools:
- llama.cpp
- Ollama
- LM Studio
- GPT4All

💡 Tip: Q4_K_M offers the best balance for most use cases.

Downloads last month: 320

GGUF

Model size

3.86B params

Architecture

gemma2

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

View +11 variants

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for matrixportal/Gemmasutra-Small-4B-v1-GGUF

Base model

TheDrummer/Gemmasutra-Small-4B-v1

Quantized

(19)

this model