T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published 2 days ago • 25
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax Paper • 2504.20966 • Published 4 days ago • 19
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published 5 days ago • 9
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 4 days ago • 32
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning Paper • 2504.18904 • Published 7 days ago • 7
YoChameleon: Personalized Vision and Language Generation Paper • 2504.20998 • Published 4 days ago • 10
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 4 days ago • 74
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency Paper • 2504.18589 • Published 9 days ago • 9
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects Paper • 2504.19838 • Published 5 days ago • 21
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency Paper • 2504.12080 • Published 17 days ago • 7
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark Paper • 2504.16427 • Published 11 days ago • 16
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published 10 days ago • 50