Running Featured 136 Voxtral Realtime WebGPU 💬 136 Real-time speech transcription, entirely in your browser.
Running on Zero Agents Featured 1.99k Qwen3-TTS Demo 🎙 1.99k Generate speech from text using voice design, cloning or presets
Running Agents 25 Audio To MIDI And Advanced Renderer 🎹 25 Audio to MIDI Transcription and Advanced render
MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 490k • 359
Running on Zero Agents Featured 2.57k Qwen Image Multiple Angles 3D Camera 🎥 2.57k Transform image viewpoint with adjustable camera angles