Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published 22 days ago • 40
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published Mar 10 • 57