A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone Paper • 2505.12781 • Published 15 days ago • 2
Low-Rank Clone (LRC) Collection Model checkpoints for paper "A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone". • 10 items • Updated 14 days ago • 1
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 374
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 133