TinyLlama 1.1B Chat v0.3 - GGUF

Model creator: Zhang Peiyuan
Original model: TinyLlama 1.1B Chat v0.3
TheBloke quant: TinyLlama-1.1B-Chat-v0.3-GGUF

Support for `calm`

These models support the calm language model runner. The particular quants selected for this repo are in support of calm, which is a language model runner that automatically uses the right prompts, templates, context size, etc.

Downloads last month: 378

GGUF

Model size

1.1B params

Architecture

llama

4-bit

6-bit

16-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF

Base model

TinyLlama/TinyLlama-1.1B-Chat-v0.3

Quantized

(4)

this model

iandennismiller
/

TinyLlama-1.1B-Chat-v0.3-GGUF

TinyLlama 1.1B Chat v0.3 - GGUF

Support for `calm`

Model tree for iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF

Datasets used to train iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF

TinyLlama 1.1B Chat v0.3 - GGUF

Support for calm

Model tree for iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF

Datasets used to train iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF

Support for `calm`