HKUSTAudio
/

Llasa-1B-Multilingual

Model card Files Files and versions Community

ZhenYe234 commited on Feb 7

Commit

1fe189d

·

verified ·

1 Parent(s): e224984

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ pipeline_tag: text-to-speech
 **Main Idea:**
 This model enhances previous  Llasa TTS by incorporating multilingual data. The approach leverages the LLAMA-initialized text BPE tokenizer,
 which is adept at handling multilingual text without the need to design language-specific G2P (grapheme-to-phoneme) systems.
-Although the multilingual training data is limited—using only the MLS and Emilia datasets—resulting in potentially less optimal performance for some languages due to data scarcity,
 our model can serve as a  base TTS model. It is particularly suitable for fine-tuning for a specific language, as texts in various languages can be uniformly processed using the BPE tokenizer from Llama.
 This model is not mentioned in the paper, but it follows the same methodology.

 **Main Idea:**
 This model enhances previous  Llasa TTS by incorporating multilingual data. The approach leverages the LLAMA-initialized text BPE tokenizer,
 which is adept at handling multilingual text without the need to design language-specific G2P (grapheme-to-phoneme) systems.
+Although the multilingual training data is limited—using only the MLS (En/Fr/De/Nl/Es/It/Pt/Pl) and Emilia  (En/Zh/De/Fr/Ja/Ko) datasets—resulting in potentially less optimal performance for some languages due to data scarcity,
 our model can serve as a  base TTS model. It is particularly suitable for fine-tuning for a specific language, as texts in various languages can be uniformly processed using the BPE tokenizer from Llama.
 This model is not mentioned in the paper, but it follows the same methodology.