Clinical knowledge in LLMs does not translate to human interactions Paper • 2504.18919 • Published 12 days ago • 24
Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption Paper • 2504.20769 • Published 9 days ago • 3
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering Paper • 2504.20114 • Published 10 days ago • 5
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper • 2504.20734 • Published 9 days ago • 60
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 9 days ago • 88
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 8 days ago • 37
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 8 days ago • 41
AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization Paper • 2504.21659 • Published 8 days ago • 9
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks Paper • 2505.00234 • Published 7 days ago • 21
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published 6 days ago • 39
CORG: Generating Answers from Complex, Interrelated Contexts Paper • 2505.00023 • Published 13 days ago • 8
Improving Editability in Image Generation with Layer-wise Memory Paper • 2505.01079 • Published 6 days ago • 23
SVDQuant Collection Models and datasets for "SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models" • 20 items • Updated Mar 17 • 35
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published 22 days ago • 28