This post is to ask for help regarding a personal project of mine.
So as a heads up, I’m very new to Machine Learning. I mostly a engaged in development stuff. But recently I took on a project where I have to convert text to lip-synced video file.
I need to first generate a WAV file from text. For that, Im looking for a TTS software. I just want a somewhat human-like voice for my project so I am not looking for a very high-quality voice.
I tried to use Tortoise TTS but I failed during the installation process and I can’t find a good enough tutorial I can follow. Also, it seems Tortoise and many other AI tools work with a NVIDIA GPU which I don’t have (I got a system with AMD integrated graphics). So does anyone have a tutorial or suggestion how to install tortoise?
Or do you have any suggestion for any other TTS to use?