Gemmasutra-Small-4B-v1 GGUF Quantized Models
Model Information
- Base Model: TheDrummer/Gemmasutra-Small-4B-v1
- Quantized by: matrixportal
- Format: GGUF (for llama.cpp compatible tools)
- Quantized on: 2025-04-09
Recommended Downloads
- Q4_K_M:
gemmasutra-small-4b-v1.q4_k_m.gguf
- Q4_0:
gemmasutra-small-4b-v1.q4_0.gguf
- Q8_0:
gemmasutra-small-4b-v1.q8_0.gguf
All Available Quantizations
File | Download |
---|---|
gemmasutra-small-4b-v1.f16.gguf |
Download |
gemmasutra-small-4b-v1.q2_k.gguf |
Download |
gemmasutra-small-4b-v1.q3_k_m.gguf |
Download |
gemmasutra-small-4b-v1.q4_0.gguf |
Download |
gemmasutra-small-4b-v1.q4_k_m.gguf |
Download |
gemmasutra-small-4b-v1.q5_k_m.gguf |
Download |
gemmasutra-small-4b-v1.q6_k.gguf |
Download |
gemmasutra-small-4b-v1.q8_0.gguf |
Download |
Usage Instructions
๐ก Tip: Q4_K_M offers the best balance for most use cases.
- Downloads last month
- 320
Hardware compatibility
Log In
to view the estimation
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for matrixportal/Gemmasutra-Small-4B-v1-GGUF
Base model
TheDrummer/Gemmasutra-Small-4B-v1