Scalable Chain of Thoughts via Elastic Reasoning Paper β’ 2505.05315 β’ Published 6 days ago β’ 22
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper β’ 2504.17432 β’ Published 20 days ago β’ 38
π March 2025 - Open releases from the Chinese community Collection 30 items β’ Updated Apr 2 β’ 12
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents Paper β’ 2503.01935 β’ Published Mar 3 β’ 27
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper β’ 2503.04872 β’ Published Mar 6 β’ 15
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 229
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 β’ 73
Centurio Collection Artifacts of the paper "Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model" β’ 6 items β’ Updated Feb 4 β’ 4