h1t/TCD-SDXL-LoRA
Text-to-Image • Updated • 800 • • 117
Text-to-speech (TTS) with Next-gen Kaldi
Generate voice with text or audio input
Generate high-quality speech from text using a prompt audio
Generate speech in a cloned voice from a short audio clip
Generate a talking face video from an image and audio
Generate animated videos from images and motion sequences