Submitted by siyue 52 Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective · 6 authors 2
Submitted by MingxingLi 50 UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning · 8 authors 5
Submitted by Emaad 36 This Time is Different: An Observability Perspective on Time Series Foundation Models · 17 authors 3
Submitted by PeterV09 32 Learn to Reason Efficiently with Adaptive Length-based Reward Shaping · 8 authors 2
Submitted by knightnemo 25 Vid2World: Crafting Video Diffusion Models to Interactive World Models · 5 authors 2
Submitted by Amanda2023 22 When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning · 9 authors 2
Submitted by TongZheng1999 17 Learning to Reason via Mixture-of-Thought for Logical Reasoning · 5 authors 7
Submitted by yanyc 17 VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models · 12 authors 2
Submitted by xw-eric 15 Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space · 8 authors 2
Submitted by JamesMile 15 Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs · 11 authors 2
Submitted by nonstopfor 13 Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen! · 6 authors 2
Submitted by nonstopfor 13 How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study · 11 authors 2
Submitted by bytehxf 12 DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling · 6 authors 2
Submitted by yangjunxiao2021 11 BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs · 12 authors 2
Submitted by sinwang 10 ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning · 5 authors 2
Submitted by Ziruibest 7 Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs · 9 authors 2
Submitted by IvanTang 7 AutoMat: Enabling Automated Crystal Structure Reconstruction from Microscopy via Agentic Tool Use · 17 authors 2
Submitted by shivamag99 6 The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning · 5 authors 2
Submitted by huangsiteng 5 VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL · 7 authors 2
Submitted by Ziruibest 5 Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models · 12 authors 2
Submitted by yapeichang 5 BLEUBERI: BLEU is a surprisingly effective reward for instruction following · 7 authors 2
Submitted by Fengzhuo 4 BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms · 9 authors 2
Submitted by Mellen 4 PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration · 3 authors 2
Submitted by sunshinekevin 4 RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning · 6 authors 2
Submitted by zxbsmk 4 WebNovelBench: Placing LLM Novelists on the Web Novel Distribution · 3 authors 2
Submitted by shainaraza 4 HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation · 8 authors 2
Submitted by craigwu 3 Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM · 3 authors 2
Submitted by hisoka94 3 Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach · 5 authors 2
Submitted by NathanRoll 2 In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties · 6 authors 2
Submitted by ernlavr 2 MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations · 4 authors 2
Submitted by ishikaa 1 Language Specific Knowledge: Do Models Know Better in X than in English? · 3 authors 2