SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights Paper β’ 2410.09008 β’ Published Oct 11, 2024 β’ 17
HuggingFaceTB/SmolVLM2-256M-Video-Instruct Image-Text-to-Text β’ Updated 13 days ago β’ 6.25k β’ 42
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper β’ 2502.14768 β’ Published 27 days ago β’ 46
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper β’ 2502.16894 β’ Published 23 days ago β’ 27
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 6 days ago β’ 696k β’ 1.18k
Physics of Language Models: Part 1, Context-Free Grammar Paper β’ 2305.13673 β’ Published May 23, 2023 β’ 7
LoRA: Low-Rank Adaptation of Large Language Models Paper β’ 2106.09685 β’ Published Jun 17, 2021 β’ 37
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems Paper β’ 2408.16293 β’ Published Aug 29, 2024 β’ 26
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process Paper β’ 2407.20311 β’ Published Jul 29, 2024 β’ 5
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Paper β’ 2404.05405 β’ Published Apr 8, 2024 β’ 10
Physics of Language Models: Part 3.2, Knowledge Manipulation Paper β’ 2309.14402 β’ Published Sep 25, 2023 β’ 7
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper β’ 2309.14316 β’ Published Sep 25, 2023 β’ 8
DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks Paper β’ 2502.17157 β’ Published 23 days ago β’ 51
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning Paper β’ 2503.05641 β’ Published 12 days ago β’ 1