TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models Paper • 2504.20605 • Published 9 days ago • 12
Sadeed: Advancing Arabic Diacritization Through Small Language Model Paper • 2504.21635 • Published 8 days ago • 54
A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis Paper • 2504.12322 • Published 27 days ago • 28
Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks • 3 items • Updated 4 days ago • 1
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 7 days ago • 41
Phi-4 (All Versions) Collection Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes • 20 items • Updated 6 days ago • 68
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 7 days ago • 80
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement Paper • 2504.07934 • Published 27 days ago • 18