TIGER-Lab
/

VL-Reasoner-72B

Visual Question Answering

image-text-to-text

text-generation-inference

Model card Files Files and versions Community

JasperHaozhe commited on 20 days ago

Commit

8daa6df

·

verified ·

1 Parent(s): 340fa52

Update README.md

Files changed (1) hide show

README.md +14 -16

README.md CHANGED Viewed

@@ -9,29 +9,27 @@ tags:
 - multimodal
 pipeline_tag: visual-question-answering
 ---
-# MM-Thinker-72B-Preview
-## Model Overview
-- **MM-thinker-72B-Preview** improves visual reasoning upon [Qwen2.5-VL-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-72B-Instruct) model.
-- As of April 3rd, 2025, **MM-thinker-72B-Preview** achieves superior results on various visual reasoning benchmarks ([MathVision](https://mathllm.github.io/mathvision/),[MathVista](https://mathvista.github.io/),  [MathVerse](https://mathverse-cuhk.github.io/), [MMMU-Pro](), [EMMA](), [MEGA]()).
-## Evaluation
-We will release a code repository for VLM evaluation. It supports RL training with simple rule-based rewards, meanwhile aligning with LLM-Judge results.
-Stay tuned!
 ## Citation
-If you find our model useful, please consider citing:
 ```
-@misc {MM-Thinker-72B,
-	author       = { Wang, Haozhe and Lin, Fangzhen and Chen, Wenhu },
-	title        = { MM-Thinker-72B },
-	year         = 2025,
-	url          = { https://huggingface.co/TIGER-Lab/MM-Thinker-72B},
-	publisher    = { Hugging Face }
-}
 ```

 - multimodal
 pipeline_tag: visual-question-answering
 ---
+# VL-Reasoner-72B
+**VL-Reasoner-72B** achieves superior results on various multimodal reasoning benchmarks.
+It is trained using the **GRPO-SSR** techniques, serving as the foundation for [**VL-Rethinker**](https://huggingface.co/TIGER-Lab/VL-Rethinker-72B/).
+For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
+For details of training and evaluation, please see our [code repo](https://github.com/TIGER-AI-Lab/VL-Rethinker/).
+Explore further via the following links:
+| [**🚀Project Page**](https://tiger-ai-lab.github.io/VL-Rethinker/) | [**📖Paper**](https://arxiv.org/abs/2504.08837) | [**🔗Github**](https://github.com/TIGER-AI-Lab/VL-Rethinker/) | [**🤗Data** (Coming Soon)]()  |
 ## Citation
+If you feel this model useful, please give us a free cite:
 ```
+@article{vl-rethinker,
+      title={VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning},
+      author = {Wang, Haozhe and Qu, Chao and Huang, Zuming and Chu, Wei and Lin, Fangzhen and Chen, Wenhu},
+      journal={arXiv preprint arXiv:2504.08837},
+      year={2025}
+}
 ```