view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 β’ 251
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Paper β’ 2503.10639 β’ Published Mar 13 β’ 50
HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper β’ 2503.02003 β’ Published Mar 3 β’ 48
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Paper β’ 2502.19400 β’ Published Feb 26 β’ 49
SurveyX: Academic Survey Automation via Large Language Models Paper β’ 2502.14776 β’ Published Feb 20 β’ 100
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published Jan 8 β’ 276
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper β’ 2501.05707 β’ Published Jan 10 β’ 20
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper β’ 2501.05874 β’ Published Jan 10 β’ 72
Running on Zero 27 27 Newborn Article Impact Predict π» Use title and abstract to predict future academic impact