ElevenLabs just released their speech to speech thing and it’s really cool: https://elevenlabs.io/voice-changer

Now I’m wondering what’s the best similar “voice changer” or speech to speech model that I can run locally?

It doesn’t have to be in real time, I plan on using it to narrate audio books and similar.

Thanks!

  • JawGBoiB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    RVC is definitely the best for this. Unlike most other methods, you don’t provide text transcriptions for the training dataset - this makes RVC models really easy to train and there is no compromise of quality.