Boyuan Zheng's picture

5 8 14

Boyuan Zheng

boyuanzheng010

·

https://boyuanzheng010.github.io/

AI & ML interests

Language Agents, Multilinguality

Recent Activity

upvoted a paper 18 days ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

liked a Space 18 days ago

McGill-NLP/agent-reward-bench-demo

upvoted a paper 23 days ago

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

View all activity

Organizations

boyuanzheng010's activity

upvoted a paper 18 days ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published 22 days ago • 27

upvoted a paper 23 days ago

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Paper • 2504.07079 • Published 24 days ago • 11

upvoted a paper 5 months ago

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published Nov 10, 2024 • 14

upvoted a paper 7 months ago

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 19

upvoted 2 papers about 1 year ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28, 2024 • 22

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Paper • 2401.01614 • Published Jan 3, 2024 • 23

upvoted a paper over 1 year ago

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35