Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 9 days ago • 88
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models Paper • 2502.16033 • Published Feb 22 • 18
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos Paper • 2406.08407 • Published Jun 12, 2024 • 29
Discriminative Diffusion Models as Few-shot Vision and Language Learners Paper • 2305.10722 • Published May 18, 2023 • 3