HKUSTAudio
/

Llasa-1B-Multilingual

Model card Files Files and versions Community

HKUST-Audio commited on Feb 7

Commit

919fda7

·

verified ·

1 Parent(s): d465d50

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ pipeline_tag: text-to-speech
 **Main Idea:**
 This model enhances previous  Llasa TTS by incorporating multilingual data. The approach leverages the LLAMA-initialized text BPE tokenizer,
-which is adept at handling multilingual text without the need to design language-specific G2P (grapheme-to-phoneme) systems.
 Although the multilingual training data is limited—using only the MLS (En/Fr/De/Nl/Es/It/Pt/Pl) and Emilia  (En/Zh/De/Fr/Ja/Ko) datasets—resulting in potentially less optimal performance for some languages due to data scarcity,
 our model can serve as a  base TTS model. It is particularly suitable for fine-tuning for a specific language, as texts in various languages can be uniformly processed using the BPE tokenizer from Llama.

 **Main Idea:**
 This model enhances previous  Llasa TTS by incorporating multilingual data. The approach leverages the LLAMA-initialized text BPE tokenizer,
+which can handle multilingual text without the need to design language-specific G2P (grapheme-to-phoneme) systems.
 Although the multilingual training data is limited—using only the MLS (En/Fr/De/Nl/Es/It/Pt/Pl) and Emilia  (En/Zh/De/Fr/Ja/Ko) datasets—resulting in potentially less optimal performance for some languages due to data scarcity,
 our model can serve as a  base TTS model. It is particularly suitable for fine-tuning for a specific language, as texts in various languages can be uniformly processed using the BPE tokenizer from Llama.