Is there any sort of project that is combines Text + Image + TTSVoice generation in one single UI ?

Starkboy · 1 year ago

Is there any sort of project that is combines Text + Image + TTSVoice generation in one single UI ?

DanIngenius · 1 year ago

This is something I’m interested in working on, i want to crowd fund a good LLM + SD + TTSvoice host, DM me if you are interested in taking part!

a_beautiful_rhind · 1 year ago

Two off the top of my head: https://heyamica.com/ and silly tavern for fun stuff.

For agents there are https://github.com/spyglass-search/talos or https://github.com/Josh-XT/AGiXT

I think the problems is work + play aren’t really the same goals.

Starkboy · 1 year ago

Thanks for your answer! I get it. These projects do give me some ideas. I didn’t know such things are called ‘agents’ in this space

LyPreto · 1 year ago

you have all the APIs whats stopping you from putting something like this together? personally for me the only challenge is finding projects compatible with M1 that offer Metal offloading— but for linux it should be relatively straightforward to implement