Transcribe audio from microphone, files, or YouTube
Generate audio from text using a voice synthesis model
Generate audio from text using voice synthesis
Generate speech from text using a reference audio sample