DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published 6 days ago • 46
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 8 days ago • 37
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 7 days ago • 41
Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 14 items • Updated about 1 month ago • 5
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published Feb 20 • 103
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement Paper • 2412.12881 • Published Dec 17, 2024 • 2
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 276
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published Jan 10 • 66
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published Jan 10 • 72
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 289
LLM4SR: A Survey on Large Language Models for Scientific Research Paper • 2501.04306 • Published Jan 8 • 37
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Paper • 2501.04686 • Published Jan 8 • 54
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Paper • 2501.01904 • Published Jan 3 • 34