Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published Apr 29 • 90
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published Apr 30 • 44
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model Paper • 2211.11363 • Published Nov 21, 2022 • 1
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20, 2024 • 51
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning Paper • 2403.17919 • Published Mar 26, 2024 • 16
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6, 2024 • 189
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models Paper • 2403.16187 • Published Mar 24, 2024
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 27 days ago • 166
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning Paper • 2505.16400 • Published 11 days ago • 28
Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published 15 days ago • 35
A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone Paper • 2505.12781 • Published 14 days ago • 2
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Paper • 2505.15778 • Published 11 days ago • 15