Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models Paper • 2504.20157 • Published 4 days ago • 31
Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters Paper • 2403.02677 • Published Mar 5, 2024 • 18
Writing Assistants Should Model Social Factors of Language Paper • 2303.16275 • Published Mar 28, 2023
Writing Assistants Should Model Social Factors of Language Paper • 2303.16275 • Published Mar 28, 2023