Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
BaMbUM 's Collections
GUI Agents

GUI Agents

updated Mar 6
Upvote
-

  • Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

    Paper • 2404.05719 • Published Apr 8, 2024 • 83

  • ShowUI: One Vision-Language-Action Model for GUI Visual Agent

    Paper • 2411.17465 • Published Nov 26, 2024 • 87

  • Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms

    Paper • 2410.18967 • Published Oct 24, 2024 • 1

  • OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

    Paper • 2410.23218 • Published Oct 30, 2024 • 51

  • Agent S: An Open Agentic Framework that Uses Computers Like a Human

    Paper • 2410.08164 • Published Oct 10, 2024 • 24

  • Large Language Model-Brained GUI Agents: A Survey

    Paper • 2411.18279 • Published Nov 27, 2024 • 32

  • UI-TARS: Pioneering Automated GUI Interaction with Native Agents

    Paper • 2501.12326 • Published Jan 21 • 58
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs