The closest I got to ChatGPT+Dall-E locally (SDXL+LLaMA2-13B-Tiefighter)

iChrist · 1 year ago

The closest I got to ChatGPT+Dall-E locally (SDXL+LLaMA2-13B-Tiefighter)

a_beautiful_rhind · 1 year ago

I use the 70b to chat and it also prompts SD during the convo. I agree for just SD you can use almost any LLM model.

IME, TensorRT didn’t help. Just shaved a second off. I also tried the vlad version (diffusers) and to compile the model. If I use the 3090 I get somewhere around 6 seconds for 1024x1024 and I found that XL doesn’t do as good for smaller images.

In chat and not serious SD, even 576x576 is “enough” on this 1080P laptop. On the P40 that takes 12 seconds.

Ideally for actual SD, I will try comfyUI at some point. AFAIK, it’s the only UI that does XL properly; where the latent image is passed to the refiner model. Probably why my XL outputs don’t look much better than good 1.5 models.