ElevenLabs just released their speech to speech thing and it’s really cool: https://elevenlabs.io/voice-changer
Now I’m wondering what’s the best similar “voice changer” or speech to speech model that I can run locally?
It doesn’t have to be in real time, I plan on using it to narrate audio books and similar.
Thanks!
You must log in or register to comment.
rvc or so-vits-svc
You could do a audio transcription then TTS to achieve the similar results with whisper and coqui-ai TTS models.