Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xw-eric 's Collections
Multimodal Reasoning

Multimodal Reasoning

updated 12 days ago

A collection for Multimodal Reasoning Models and Benchmarks.

Upvote
-

  • Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

    Paper • 2502.16033 • Published Feb 22 • 18

  • rippleripple/MMIR

    Viewer • Updated Feb 25 • 534 • 41 • 2

  • LLaVA-o1: Let Vision Language Models Reason Step-by-Step

    Paper • 2411.10440 • Published Nov 15, 2024 • 125

  • GRIT: Teaching MLLMs to Think with Images

    Paper • 2505.15879 • Published 13 days ago • 12

  • Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

    Paper • 2505.15778 • Published 13 days ago • 15
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs