Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,15 @@ library_name: transformers
|
|
10 |
<img width="30%" src="figures/logo.png">
|
11 |
</div>
|
12 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
## Introduction
|
14 |
|
15 |
We present **Kimi-VL**, an efficient open-source Mixture-of-Experts (MoE) vision-language model (VLM) that offers **advanced multimodal reasoning, long-context understanding, and strong agent capabilities**—all while activating only **2.8B** parameters in its language decoder (Kimi-VL-A3B).
|
|
|
10 |
<img width="30%" src="figures/logo.png">
|
11 |
</div>
|
12 |
|
13 |
+
<div align="center">
|
14 |
+
<a href="https://arxiv.org/abs/2504.07491">
|
15 |
+
<b>📄 Tech Report</b>
|
16 |
+
</a> |
|
17 |
+
<a href="https://github.com/MoonshotAI/Kimi-VL">
|
18 |
+
<b>📄 Github</b>
|
19 |
+
</a> |
|
20 |
+
<a href="https://huggingface.co/spaces/moonshotai/Kimi-VL-A3B-Thinking/">💬 Chat Web</a>
|
21 |
+
</div>
|
22 |
## Introduction
|
23 |
|
24 |
We present **Kimi-VL**, an efficient open-source Mixture-of-Experts (MoE) vision-language model (VLM) that offers **advanced multimodal reasoning, long-context understanding, and strong agent capabilities**—all while activating only **2.8B** parameters in its language decoder (Kimi-VL-A3B).
|