Update README.md
Browse files
README.md
CHANGED
@@ -13,9 +13,11 @@ library_name: transformers
|
|
13 |
|
14 |
# VL-Rethinker-7B
|
15 |
|
|
|
|
|
16 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
17 |
|
18 |
-
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated
|
19 |
|
20 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
21 |
|
|
|
13 |
|
14 |
# VL-Rethinker-7B
|
15 |
|
16 |
+
**🚀 News:** <u>We release our meticulously curated collection of RL training queries for multimodal reasoning: [ViRL39K](https://huggingface.co/datasets/TIGER-Lab/ViRL39K).</u>
|
17 |
+
|
18 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
19 |
|
20 |
+
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated [ViRL39K](https://huggingface.co/datasets/TIGER-Lab/ViRL39K).
|
21 |
|
22 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
23 |
|