OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
User submitted (not verified)
Explore
Speech, audio, and voice-related AI projects that you can run locally.
Projects
Instant voice cloning by MIT and MyShell. Audio foundation model.
Multi-model AI
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A fast and local neural text-to-speech engine that embeds espeak-ng for phonemization.
Faster Whisper transcription with CTranslate2
Port of OpenAI's Whisper model in C/C++