Wan2.1 14B T2V LoRAs Collection A collection of Remade's Wan2.1 14B T2V LoRAs • 20 items • Updated Mar 27 • 12
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Paper • 2504.02542 • Published Apr 3 • 44
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 19 days ago • 185
xLAM-2 Collection A family of Large Action Model for multi-turn conversation and tool-use • 10 items • Updated 1 day ago • 13
💫StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 93
Babel Collection Open Multilingual Large Language Models Serving Over 90% of Global Speakers • 5 items • Updated 22 days ago • 17
PixArt-Alpha Collection This collection organize all the PixArt-Alpha related models, datasets and so on. • 9 items • Updated May 4, 2024 • 4
ViTPose Collection Collection for ViTPose models based on transformers implementation. • 10 items • Updated Jan 12 • 13
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 1 day ago • 224
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7 • 144
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated Apr 3 • 147