Hey guys, just wondering if anyone has had success finetuning StyleTTS2 yet?

The only one I can find is the LJSpeech model, which sounds really good! But wondering what some other narrators / speakers would sound like, especially voices more outside the training dataset.

(Seems zero shot prompting at runtime gives low quality, so need real finetunes!)