Submitted by zhoutianyi 44 ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness · 10 authors 4
Submitted by JunhaoZhuang 23 Cobra: Efficient Line Art COlorization with BRoAder References · 6 authors 2
Submitted by YangshenDeng 23 AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference · 16 authors 2
Submitted by g-h-chen 20 SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models · 8 authors 2
Submitted by nielsr 17 REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers · 6 authors 2
Submitted by panprabh 15 SIFT-50M: A Large-Scale Multilingual Dataset for Speech Instruction Fine-Tuning · 7 authors 2
Submitted by yunx-z 13 MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges? · 9 authors 2
Submitted by JaceyH919 8 Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting · 5 authors 2
Submitted by CatWorldLee 8 Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution · 10 authors 2
Submitted by SunshineWu 6 BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian Splatting · 4 authors 2
Submitted by nthakur 3 FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents · 6 authors 3
Submitted by evijit 2 "It's not a representation of me": Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services · 6 authors 2