JasperHaozhe commited on
Commit
392ad69
·
verified ·
1 Parent(s): 4ca9dbb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-VL-72B-Instruct
4
+ language:
5
+ - en
6
+ license: apache-2.0
7
+ tags:
8
+ - transformers
9
+ - multimodal
10
+ pipeline_tag: visual-question-answering
11
+ ---
12
+ # VL-Rethinker-72B-Preview
13
+ ## Model Overview
14
+ - **VL-Rethinker-72B-Preview** improves visual reasoning upon [Qwen2.5-VL-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-72B-Instruct) model.
15
+
16
+ - As of April 3rd, 2025, **VL-Rethinker-72B-Preview** achieves superior results on various visual reasoning benchmarks ([MathVision](https://mathllm.github.io/mathvision/),[MathVista](https://mathvista.github.io/), [MathVerse](https://mathverse-cuhk.github.io/), [MMMU-Pro](), [EMMA](), [MEGA]()).
17
+
18
+
19
+ ## Evaluation
20
+
21
+ We will release a code repository for VLM evaluation. It supports RL training with simple rule-based rewards, meanwhile aligning with LLM-Judge results.
22
+
23
+ Stay tuned!
24
+
25
+
26
+ ## Citation
27
+ If you find our model useful, please consider citing:
28
+
29
+ ```
30
+ @misc {VL-Rethinker-72B-Preview,
31
+ author = { Wang, Haozhe and Lin, Fangzhen and Chen, Wuhu },
32
+ title = { VL-Rethinker-72B-Preview },
33
+ year = 2025,
34
+ url = { https://huggingface.co/TIGER-Lab/VL-Rethinker-Preview},
35
+ publisher = { Hugging Face }
36
+ }
37
+ ```