FreSca: Unveiling the Scaling Space in Diffusion Models Paper • 2504.02154 • Published 7 days ago • 17
FreSca: Unveiling the Scaling Space in Diffusion Models Paper • 2504.02154 • Published 7 days ago • 17 • 2
Video Understanding with Large Language Models: A Survey Paper • 2312.17432 • Published Dec 29, 2023 • 3
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos? Paper • 2411.10979 • Published Nov 17, 2024
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity Paper • 2503.11557 • Published 26 days ago • 19
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity Paper • 2503.11557 • Published 26 days ago • 19
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published Feb 20 • 97