Text-to-Speech
index-tts
index-tts commited on
Commit
c2ebde5
Β·
verified Β·
1 Parent(s): b83e4d4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -3
README.md CHANGED
@@ -1,3 +1,40 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ <div align="center">
5
+ <img src='assets/index_icon.png' width="250"/>
6
+ </div>
7
+
8
+
9
+ <h2><center>IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System</h2>
10
+
11
+ <p align="center">
12
+ <a href='https://arxiv.org/abs/2502.05512'><img src='https://img.shields.io/badge/ArXiv-2502.05512-red'></a>
13
+
14
+ ## πŸ‘‰πŸ» IndexTTS πŸ‘ˆπŸ»
15
+
16
+ [[Paper]](https://arxiv.org/abs/2502.05512) [[Demos]](https://index-tts.github.io) [[Codes]](https://github.com/index-tts/index-tts)
17
+
18
+ **IndexTTS** is a GPT-style text-to-speech (TTS) model mainly based on XTTS and Tortoise. It is capable of correcting the pronunciation of Chinese characters using pinyin and controlling pauses at any position through punctuation marks. We enhanced multiple modules of the system, including the improvement of speaker condition feature representation, and the integration of BigVGAN2 to optimize audio quality. Trained on tens of thousands of hours of data, our system achieves state-of-the-art performance, outperforming current popular TTS systems such as XTTS, CosyVoice2, Fish-Speech, and F5-TTS.
19
+ <span style="font-size:16px;">
20
+ Experience **IndexTTS**: Please contact <u>[email protected]</u> for more detailed information. </span>
21
+
22
+ ## Acknowledge
23
+ 1. [tortoise-tts](https://github.com/neonbjb/tortoise-tts)
24
+ 2. [XTTSv2](https://github.com/coqui-ai/TTS)
25
+ 3. [BigVGAN](https://github.com/NVIDIA/BigVGAN)
26
+ 4. [wenet](https://github.com/wenet-e2e/wenet/tree/main)
27
+ 5. [icefall](https://github.com/k2-fsa/icefall)
28
+
29
+ ## πŸ“š Citation
30
+
31
+ 🌟 If you find our work helpful, please leave us a star and cite our paper.
32
+
33
+ ```
34
+ @article{deng2025indextts,
35
+ title={IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System},
36
+ author={Wei Deng, Siyi Zhou, Jingchen Shu, Jinchao Wang, Lu Wang},
37
+ journal={arXiv preprint arXiv:2502.05512},
38
+ year={2025}
39
+ }
40
+ ```