Record your voice by reading prompts
Generate images from text prompts
Transform and identify speech with MMS