Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps Paper • 2505.18675 • Published 10 days ago • 23
VeriThinker: Learning to Verify Makes Reasoning Model Efficient Paper • 2505.17941 • Published 11 days ago • 24
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Paper • 2505.16990 • Published 12 days ago • 20