BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper • 2505.09568 • Published 18 days ago • 85
GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning Paper • 2505.11049 • Published 17 days ago • 58
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 12 days ago • 124
One RL to See Them All: Visual Triple Unified Reinforcement Learning Paper • 2505.18129 • Published 9 days ago • 56