
Foundation Text-Generation Models Below 360M Parameters
Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters.
Text Generation • Updated • 100k • 48Note License: apache-2.0 Context Length: 8k
PleIAs/Pleias-350m-Preview
Updated • 448 • 22Note License: apache-2.0 Context Length: 2k
OuteAI/Lite-Oute-1-300M
Text Generation • Updated • 320 • 7Note License: apache-2.0 Context Length: 4k
keeeeenw/MicroLlama
Text Generation • Updated • 2.75k • 48Note License: apache-2.0 Context Length: 2k
cerebras/Cerebras-GPT-256M
Text Generation • Updated • 211 • 25Note License: apache-2.0 Context Length: 2k
UUFO-Aigis/Pico-OpenLAiNN-250M
Updated • 5 • 3Note License: apache-2.0 Context Length: 2k
upstage/TinySolar-248m-4k
Text Generation • Updated • 325 • 7Note License: apache-2.0 Context Length: 4k
M4-ai/TinyMistral-248M-v3
Text Generation • Updated • 68 • 8Note License: apache-2.0 Context Length: 2k
MiniLLM/MiniPLM-llama3.1-212M
Text Generation • Updated • 160 • 4Note License: apache-2.0 Context Length: 1k
MiniLLM/MiniPLM-Qwen-200M
Text Generation • Updated • 213 • 4Note License: apache-2.0 Context Length: 1k
princeton-nlp/Sheared-Pythia-160m
Text Generation • Updated • 15 • 4Note License: apache-2.0 Context Length: 2k
JackFram/llama-160m
Text Generation • Updated • 329k • 34Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-160M
Text Generation • Updated • 155 • 5Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-160m
Text Generation • Updated • 122k • 31Note License: apache-2.0 Context Length: 2k
openai-community/gpt2
Text Generation • Updated • 12M • 2.71kNote License: mit Context Length: 1k
HuggingFaceTB/SmolLM2-135M
Text Generation • Updated • 616k • 88Note License: apache-2.0 Context Length: 8k
amd/AMD-Llama-135m
Text Generation • Updated • 10.4k • 112Note License: apache-2.0 Context Length: 2k
MiniLLM/MiniPLM-Mamba-130M
Text Generation • Updated • 19 • 3Note License: apache-2.0 Context Length: 1k
EleutherAI/gpt-neo-125m
Text Generation • Updated • 168k • 205Note License: mit Context Length: 2k
cerebras/Cerebras-GPT-111M
Text Generation • Updated • 9.09k • 76Note License: apache-2.0 Context Length: 2k
BEE-spoke-data/smol_llama-101M-GQA
Text Generation • Updated • 634 • 28Note License: apache-2.0 Context Length: 1k
UUFO-Aigis/Pico-OpenLAiNN-100M
Updated • 1 • 1Note License: apache-2.0 Context Length: 2k
Felladrin/Qwen2-96M
Text Generation • Updated • 16 • 2Note License: apache-2.0 Context Length: 8k
Felladrin/Minueza-2-96M
Text Generation • Updated • 590 • 6Note License: apache-2.0 Context Length: 4k
distilbert/distilgpt2
Text Generation • Updated • 3.1M • 530Note License: apache-2.0 Context Length: 1k
weiser/82M-0.4
Text Generation • Updated • 13 • 1Note License: apache-2.0 Context Length: 1k
BEE-spoke-data/smol_llama-81M-tied
Text Generation • Updated • 14 • 6Note License: apache-2.0 Context Length: 1k
EleutherAI/pythia-70m
Updated • 107k • 66Note License: apache-2.0 Context Length: 2k
JackFram/llama-68m
Text Generation • Updated • 543k • 27Note License: apache-2.0 Context Length: 2k
OuteAI/Lite-Oute-1-65M
Text Generation • Updated • 130 • 9Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-60M
Text Generation • Updated • 32 • 4Note License: apache-2.0 Context Length: 2k
Felladrin/Minueza-32M-Base
Text Generation • Updated • 75 • 18Note License: apache-2.0 Context Length: 2k
GerbilLab/Gerbil-A-32m
Text Generation • Updated • 22 • 2Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-31m
Text Generation • Updated • 37.5k • 5Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-20M
Text Generation • Updated • 49 • 9Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-14m
Text Generation • Updated • 194k • 23Note License: apache-2.0 Context Length: 2k