Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
imjliao 's Collections
Agent
Summarization
Reasoning
Prompt
Synthetic Data
Dialogue
Entity
Information Retrieval
QA
Document Information Extraction
Long Context
Document AI
Tool Use
Fine Tuning
MLLM
AIF
Models

MLLM

updated Feb 11, 2024
Upvote
-

  • Question Aware Vision Transformer for Multimodal Reasoning

    Paper • 2402.05472 • Published Feb 8, 2024 • 10

  • ScreenAI: A Vision-Language Model for UI and Infographics Understanding

    Paper • 2402.04615 • Published Feb 7, 2024 • 44

  • WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

    Paper • 2402.05930 • Published Feb 8, 2024 • 40

  • More Agents Is All You Need

    Paper • 2402.05120 • Published Feb 3, 2024 • 54
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs