Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published 12 days ago • 52
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Paper • 2505.21600 • Published 5 days ago • 64
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Paper • 2505.22453 • Published 4 days ago • 42
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published 10 days ago • 75