ASR nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 20 days ago • 8.35k • 544
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 20 days ago • 8.35k • 544
INDIC TTS DATASETS my own collection of TTS Datasets for finetuning models on Indic languages. edwixx/Gujarati40h Updated Oct 18, 2024 • 10 edwixx/Tamil200hours Updated May 7, 2024 • 17
Audio Models Collection of best text-to-audio models. stabilityai/stable-audio-open-1.0 Text-to-Audio • Updated Jun 19, 2025 • 24k • 1.47k Running on Zero Agents 326 TangoFlux 🚀 326 Text to Audio (Sound SFX) Generator openbmb/MiniCPM-o-2_6 Any-to-Any • 9B • Updated Oct 5, 2025 • 375k • 1.29k
TTS Collection of some of the TTS models i found cool SWivid/F5-TTS Text-to-Speech • Updated Mar 21, 2025 • 555k • 1.17k fishaudio/fish-speech-1.4 Text-to-Speech • Updated Nov 5, 2024 • 927 • 457 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 8.83M • 3.56k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 125k • 830
ASR nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 20 days ago • 8.35k • 544
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 20 days ago • 8.35k • 544
Audio Models Collection of best text-to-audio models. stabilityai/stable-audio-open-1.0 Text-to-Audio • Updated Jun 19, 2025 • 24k • 1.47k Running on Zero Agents 326 TangoFlux 🚀 326 Text to Audio (Sound SFX) Generator openbmb/MiniCPM-o-2_6 Any-to-Any • 9B • Updated Oct 5, 2025 • 375k • 1.29k
INDIC TTS DATASETS my own collection of TTS Datasets for finetuning models on Indic languages. edwixx/Gujarati40h Updated Oct 18, 2024 • 10 edwixx/Tamil200hours Updated May 7, 2024 • 17
TTS Collection of some of the TTS models i found cool SWivid/F5-TTS Text-to-Speech • Updated Mar 21, 2025 • 555k • 1.17k fishaudio/fish-speech-1.4 Text-to-Speech • Updated Nov 5, 2024 • 927 • 457 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 8.83M • 3.56k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 125k • 830