This post is to ask for help regarding a personal project of mine.
So as a heads up, I’m very new to Machine Learning. I mostly a engaged in development stuff. But recently I took on a project where I have to convert text to lip-synced video file.
I need to first generate a WAV file from text. For that, Im looking for a TTS software. I just want a somewhat human-like voice for my project so I am not looking for a very high-quality voice.
I tried to use Tortoise TTS but I failed during the installation process and I can’t find a good enough tutorial I can follow. Also, it seems Tortoise and many other AI tools work with a NVIDIA GPU which I don’t have (I got a system with AMD integrated graphics). So does anyone have a tutorial or suggestion how to install tortoise?
Or do you have any suggestion for any other TTS to use?
I’ve already tried those. But PyTorch site says that ROCm doesn’t work with Windows anymore.