VLM-Reasoner/details_._ckpt_Qwen2.5-VL-Instruct-3B-se-v1-460step Viewer • Updated 9 days ago • 2.22k • 23
VLM-Reasoner/details_._ckpt_Qwen2.5-VL-Instruct-3B-se-v1-460step Viewer • Updated 9 days ago • 2.22k • 23
VLM-Reasoner/details_VLM-Reasoner__Qwen2.5-VL-3B-Instruct-se-v1-80step Viewer • Updated 11 days ago • 3.3k • 32
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published Mar 10 • 86
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published Mar 10 • 86
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks Paper • 2410.24032 • Published Oct 31, 2024 • 10