Text Generation
Transformers
GGUF
English
axolotl
Merge

layerskip-llama3.2-1B GGUF Quantized Models

Technical Details

  • Quantization Tool: llama.cpp
  • Version: version: 5092 (d3bd7193)

Model Information

Available Files

💡 Q4_K_M provides the best balance for most use cases

Downloads last month
212
GGUF
Model size
1.24B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for matrixportal/layerskip-llama3.2-1B-GGUF

Datasets used to train matrixportal/layerskip-llama3.2-1B-GGUF