SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model Paper • 2505.22126 • Published 6 days ago • 4
ATI: Any Trajectory Instruction for Controllable Video Generation Paper • 2505.22944 • Published 6 days ago • 6
Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape Paper • 2505.22918 • Published 6 days ago • 7
Differentiable Solver Search for Fast Diffusion Sampling Paper • 2505.21114 • Published 7 days ago • 9
MAGREF: Masked Guidance for Any-Reference Video Generation Paper • 2505.23742 • Published 5 days ago • 9
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model Paper • 2505.23606 • Published 5 days ago • 14
FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian Paper • 2505.22759 • Published 6 days ago • 20
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers Paper • 2505.23758 • Published 5 days ago • 23
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Paper • 2505.22618 • Published 6 days ago • 35
AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views Paper • 2505.23716 • Published 5 days ago • 31
VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning? Paper • 2505.23359 • Published 5 days ago • 38
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence Paper • 2505.23747 • Published 5 days ago • 65
One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models Paper • 2505.21960 • Published 6 days ago • 5
PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models Paper • 2505.22523 • Published 6 days ago • 6
EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance Paper • 2505.21876 • Published 7 days ago • 9