Submitted by wgcyeo 32 UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities · 5 authors 1
Submitted by akhaliq 17 Reinforcement Learning for Reasoning in Large Language Models with One Training Example · 14 authors 2
Submitted by passing2961 11 Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models · 4 authors 3