GGUF
English

TinyLlama 1.1B Chat v0.3 - GGUF

Support for calm

These models support the calm language model runner. The particular quants selected for this repo are in support of calm, which is a language model runner that automatically uses the right prompts, templates, context size, etc.

Downloads last month
378
GGUF
Model size
1.1B params
Architecture
llama

4-bit

6-bit

16-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF

Quantized
(4)
this model

Datasets used to train iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF