GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control Paper • 2505.22421 • Published 6 days ago • 11
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model Paper • 2505.23606 • Published 5 days ago • 14
VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos Paper • 2505.23693 • Published 5 days ago • 56
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published 10 days ago • 24
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios Paper • 2505.21333 • Published 7 days ago • 38
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published 11 days ago • 63
SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published Mar 10 • 69
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published Feb 13 • 194
Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling Paper • 2410.01440 • Published Oct 2, 2024 • 4
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Paper • 2410.01481 • Published Oct 2, 2024 • 3
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs Paper • 2410.01518 • Published Oct 2, 2024 • 3
VLMGuard: Defending VLMs against Malicious Prompts via Unlabeled Data Paper • 2410.00296 • Published Oct 1, 2024 • 6
HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration Paper • 2410.01723 • Published Oct 2, 2024 • 5
EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis Paper • 2410.01804 • Published Oct 2, 2024 • 7
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control Paper • 2410.00316 • Published Oct 1, 2024 • 7
BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval-Augmented Generation Paper • 2410.01171 • Published Oct 2, 2024 • 6
General Preference Modeling with Preference Representations for Aligning Language Models Paper • 2410.02197 • Published Oct 3, 2024 • 9
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis Paper • 2409.20059 • Published Sep 30, 2024 • 16