The closest I got to ChatGPT+Dall-E locally (SDXL+LLaMA2-13B-Tiefighter)

iChrist@alien.top · 1 year ago

The closest I got to ChatGPT+Dall-E locally (SDXL+LLaMA2-13B-Tiefighter)

mcmoose1900@alien.top · 1 year ago

I would recommend Fooocus to those who have not tried: https://github.com/lllyasviel/Fooocus

Its has a “prompt expansion” feature similar to this and OpenAI, and the short-prompt quality absolutely blows away automatic1111 (and any other UI I have tried) due to some tricks that are enabled by default.

iChrist@alien.top · 1 year ago

I also use Fooocus, its good for quick final results with one click.

But SillyTavern allows for much more than just image generation, I have a writing assistant that help me reply to costumers at work, a role play character, a general chatbot (that I wish I could connect to the internet for more up do date information), storywriter, and other cool characters that help throughout the day.

It also has an option to share it locally/cloudflare so i can use it on my phone and get images while not in my house in seconds, awesome tool that I couldn’t imagine a year ago :D

_SteerPike_@alien.top · 1 year ago

Sorry, which of the corners you listed is serving as a replacement for DALL-E here?

tech92yc@alien.top · 1 year ago

Of course this has been available locally for months and its performance is amazing, better than the OpenAI alternative

babanz@alien.top · 1 year ago

Thank you! This is fantastic work!!

Btw what hardware are you running? I’m planning on building a new PC after 6 years, and I’m struggling to pick the components 😅

help would be fantastic!! Thank you!!

a_beautiful_rhind@alien.top · 1 year ago

I do the same thing but with 70b. Then I run regular SD on a P40.

My image gen is a little slow, so I’m going to try MLC as it now supports AWQ models.

Goal here being to use the 2 P40s+3090 together at more than 8t/s and leave the other 3090 for image gen while running Goliath-120b.

To use this kind of thing away from home, I run the telegram bot.

This setup beats any service for chatting hands down.

iChrist@alien.top · 1 year ago

Why do you need 70b? for prompting SD?

I found that for good prompts even mistral 7b does the job good!

You dont need 3 GPU’s to run it all, I do it on 3090

I just installed TensorRT which improves the speeds by a big margin (automatic1111)

I generate 1024x1024 30step image in 3.5 secs instead of 9

a_beautiful_rhind@alien.top · 1 year ago

I use the 70b to chat and it also prompts SD during the convo. I agree for just SD you can use almost any LLM model.

IME, TensorRT didn’t help. Just shaved a second off. I also tried the vlad version (diffusers) and to compile the model. If I use the 3090 I get somewhere around 6 seconds for 1024x1024 and I found that XL doesn’t do as good for smaller images.

In chat and not serious SD, even 576x576 is “enough” on this 1080P laptop. On the P40 that takes 12 seconds.

Ideally for actual SD, I will try comfyUI at some point. AFAIK, it’s the only UI that does XL properly; where the latent image is passed to the refiner model. Probably why my XL outputs don’t look much better than good 1.5 models.

staladine@alien.top · 1 year ago

This is amazing , is there a repo or code linking them together ? Would love to try but not sure if I can pull it off lol. Cool for sure

iChrist@alien.top · 1 year ago

There is LoLLMs which included stable diffusion inside it (no need to install it seperatly)

but I recommend installing all three of the listed programs, the UI is amazing in Silly totally worth it, and you can use automatic1111 manually and create with more control!