Submitted by foggyforest 81 Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models · 22 authors 1
Submitted by scofield7419 56 On Path to Multimodal Generalist: General-Level and General-Bench · 32 authors 5
Submitted by vvibt 18 Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models · 13 authors 3
Submitted by akhaliq 10 Generating Physically Stable and Buildable LEGO Designs from Text · 6 authors 1
Submitted by WHB139426 10 StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant · 9 authors 1
Submitted by shengz 8 X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains · 12 authors 2
Submitted by arianhosseini 4 Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers · 5 authors 1
Submitted by PALIN2018 4 BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese · 16 authors 1
Submitted by RanjanSapkota 3 Vision-Language-Action Models: Concepts, Progress, Applications and Challenges · 4 authors 1
Submitted by dogtooth 3 SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning · 2 authors 1