ElevenLabs just released their speech to speech thing and it’s really cool: https://elevenlabs.io/voice-changer
Now I’m wondering what’s the best similar “voice changer” or speech to speech model that I can run locally?
It doesn’t have to be in real time, I plan on using it to narrate audio books and similar.
Thanks!
You could do a audio transcription then TTS to achieve the similar results with whisper and coqui-ai TTS models.
https://github.com/coqui-ai/TTS