-
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models
Paper • 2502.09696 • Published • 39 -
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment
Paper • 2502.10391 • Published • 32 -
Autellix: An Efficient Serving Engine for LLM Agents as General Programs
Paper • 2502.13965 • Published • 18 -
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Paper • 2502.14739 • Published • 97
Sangyeon Cho
josang1204
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 17 hours ago
josang1204/preference-tuning-dataset
published
a dataset
about 17 hours ago
josang1204/preference-tuning-dataset
updated
a dataset
about 17 hours ago
josang1204/huggingface-smol-course-instruction-tuning-dataset