VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos Paper • 2505.23693 • Published 5 days ago • 56
Sherlock: Self-Correcting Reasoning in Vision-Language Models Paper • 2505.22651 • Published 6 days ago • 50
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published Apr 25 • 43
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark Paper • 2504.16427 • Published Apr 23 • 17
DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models Paper • 2504.15716 • Published Apr 22 • 9
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published Apr 17 • 92
SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published Mar 10 • 69
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published Feb 13 • 194
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting Paper • 2411.17223 • Published Nov 26, 2024 • 7
Edge Weight Prediction For Category-Agnostic Pose Estimation Paper • 2411.16665 • Published Nov 25, 2024 • 6
LLMs Do Not Think Step-by-step In Implicit Reasoning Paper • 2411.15862 • Published Nov 24, 2024 • 10
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models Paper • 2411.15671 • Published Nov 23, 2024 • 9
From CISC to RISC: language-model guided assembly transpilation Paper • 2411.16341 • Published Nov 25, 2024 • 15
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis Paper • 2411.16443 • Published Nov 25, 2024 • 12
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 12