Running on Zero 62 62 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Paper • 2504.07615 • Published 14 days ago • 30
Running on Zero 62 62 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions
omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps Zero-Shot Object Detection • Updated 11 days ago • 770 • 22
omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps Zero-Shot Object Detection • Updated 11 days ago • 770 • 22