
VAST-AI/MIDI-3D
Image-to-3D
•
Updated
•
260
•
40
Generate images from text prompts
Large Animatable Human Model
Try Orpheus TTS here
Convert vocals to match reference audio
Audio Conditioned LipSync with Latent Diffusion Models
Wan: Open and Advanced Large-Scale Video Generative Models