NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning Paper • 2504.13941 • Published 24 days ago • 10
The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Paper • 2504.17768 • Published 15 days ago • 13
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published 17 days ago • 55
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Paper • 2504.19413 • Published 12 days ago • 12
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning Paper • 2504.20073 • Published 15 days ago • 9
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published 12 days ago • 10
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published 11 days ago • 22
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax Paper • 2504.20966 • Published 10 days ago • 26
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 10 days ago • 38
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 9 days ago • 43
A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency Paper • 2505.01658 • Published 7 days ago • 30
SWE-smith: Scaling Data for Software Engineering Agents Paper • 2504.21798 • Published 9 days ago • 8
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 4 days ago • 88
OSUniverse: Benchmark for Multimodal GUI-navigation AI Agents Paper • 2505.03570 • Published 3 days ago • 6
LLM-Independent Adaptive RAG: Let the Question Speak for Itself Paper • 2505.04253 • Published 3 days ago • 8