Fostering Video Reasoning via Next-Event Prediction Paper • 2505.22457 • Published 6 days ago • 27 • 2
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment Paper • 2505.21494 • Published 7 days ago • 8
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Paper • 2505.15141 • Published 13 days ago • 4
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Paper • 2505.16175 • Published 13 days ago • 39
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Paper • 2505.15141 • Published 13 days ago • 4
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Paper • 2505.16175 • Published 13 days ago • 39
Optimizing Anytime Reasoning via Budget Relative Policy Optimization Paper • 2505.13438 • Published 15 days ago • 35
Optimizing Anytime Reasoning via Budget Relative Policy Optimization Paper • 2505.13438 • Published 15 days ago • 35
On Evaluating Adversarial Robustness of Large Vision-Language Models Paper • 2305.16934 • Published May 26, 2023
Intriguing Properties of Data Attribution on Diffusion Models Paper • 2311.00500 • Published Nov 1, 2023 • 2
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Paper • 2402.08567 • Published Feb 13, 2024 • 2