[Stable Diffusion] Prompt Sharing and Learning Thread

sharlotte · Feb 8, 2024

You could try latent couple and use a template to have your character be towards the 'bottom' of your picture, leaving the headroom you want. Loads of videos but as usual this guy makes it 'easy':

You must be registered to see the links

Deleted member 1666680 · Feb 8, 2024

sharlotte said:
You could try latent couple and use a template to have your character be towards the 'bottom' of your picture, leaving the headroom you want. Loads of videos but as usual this guy makes it 'easy':
You must be registered to see the links

Yeah I know about this method, I hoped there was a simple prompting way of doing this though. If it's really just about some space above the head with not too complex stuff going on, I just tried photoshops generative expansion, it did the job fantastically.

me3 · Feb 13, 2024

It seems the forum decided to resize the image, it was 4376x960, so not sure what effects that's had.

You don't have permission to view the spoiler content. Log in or register now.

modine2021 · Feb 13, 2024

practicing vids

Deleted member 254403 · Feb 13, 2024

Fuchsschweif said:
When I create a picture with a character in the center, but I want some headroom above the character to be not occupied of it, because I want to have space for a title/text, how can I tell SD to leave that room specifically? I have a great seed but the character is almost filling the entire top.

hkennereth mentioned some, but there is another option:
In ComfyUI there is a multiareaconditioning node, webui should have something similar. Use that and conditiongsetarea. with a specific prompt per region. It is tough to get right for placing a complex scene, but for leaving area open may work.
Also via drawn masks you can do something similar.
This image contains a ComyUI worksheet, that uses this. The setup for this particular image was actually way too complex, it rarely worked.
(you can ignore the most ksampler steps at the top, they are not in use, also I didn't fully understand the advanced ksampler at that time).

modine2021 · Feb 13, 2024

Deleted member 1666680 · Feb 13, 2024

picobyte said:
hkennereth mentioned some, but there is another option:
In ComfyUI there is a multiareaconditioning node, webui should have something similar. Use that and conditiongsetarea. with a specific prompt per region. It is tough to get right for placing a complex scene, but for leaving area open may work.
Also via drawn masks you can do something similar.

Right now I just use Photoshops generative expand, it works perfect for easy things like the sky or something. For the future I will try to learn how to do proper sketches so that SD recognizes them and I can use the latent image as input for the ksampler, I think these sketches can be made in 1-2 minutes and they give maximum control over the scenery, character pose, placement and so on.

Deleted member 1666680 · Feb 14, 2024

Here's a cool tip: If you have photoshop and just want to remove something that you don't like on your picture (e.g. weird artifacts on clothing), you can just draw a shape around that part, use generative fill, and just input nothing. Just hit enter. Photoshop will usually then try to remove it.

So I could easily get rid off some weird buttons and stuff that SD put onto my character's jacket.

Sharinel · Feb 15, 2024

You must be registered to see the links

I think I can confidently state the following :-

/wibble

Jimwalrus · Feb 15, 2024

Sharinel said:
You must be registered to see the links

I think I can confidently state the following :-

/wibble

My RTX 3060 just burst into tears...

(Looks like it's going to be cloud-only, at least at first - so no NSFW

)

namhoang909 · Feb 16, 2024

devilkkw said:
Yes, made it time ago.
For crash error, have you checked if it happens with other sampler?

I don't have crash error, I downloaded it from civitai but can't find it later, so I wonder what happened

sharlotte · Feb 17, 2024

For people with less Vram, this Forge (very similar to SD) could be a good way to go:

You must be registered to see the links

Started testing this morning, with my RTX3060 it seems faster and I haven't had yet any issues with running out of memory. From what I can see in the cmd file, whilst generating, it frees memory between steps. No need to set low vram and other commands in the user bat file as it detects automatically the graphic card used. Common modules are pre-installed (like controlnet, kohya HRFix...). Super easy install.

6 minutes to generate 2 2048*2048 below with these settings:

modine2021 · Feb 20, 2024

anyone test the latest?

--->>>

You must be registered to see the links

devilkkw · Feb 20, 2024

seem good, but i wait official release before test.

I see many post on reddit about stable cascade. someone have tested? what's your impression?

modine2021 · Feb 20, 2024

devilkkw said:
seem good, but i wait official release before test.

I see many post on reddit about stable cascade. someone have tested? what's your impression?

didn't try the new one. and what is cascade?

PandaRepublic · Feb 20, 2024

Is there a new tutorial to make LoRA's? It seems like my old settings don't work anymore. If so how many repeats do you use now? I used to use 100 now it just turns out horribly. Also, withs some of the LoRA's I create I get a NaNs error. Either that or something is wrong with Kohya on my PC.

devilkkw · Feb 21, 2024

modine2021 said:
didn't try the new one. and what is cascade?

look

You must be registered to see the links

PandaRepublic said:
Is there a new tutorial to make LoRA's? It seems like my old settings don't work anymore. If so how many repeats do you use now? I used to use 100 now it just turns out horribly. Also, withs some of the LoRA's I create I get a NaNs error. Either that or something is wrong with Kohya on my PC.
View attachment 3372823 View attachment 3372825

Are you train style or person? you need to share more detail, kohya have many settings and understanding what's wrong in your training with so less information is hard. And on what model are you train? is XL or 1.5?

modine2021 · Feb 21, 2024

devilkkw said:
look
You must be registered to see the links

Are you train style or person? you need to share more detail, kohya have many settings and understanding what's wrong in your training with so less information is hard. And on what model are you train? is XL or 1.5?

hmmm... uses notebook. i don't use that

Mr-Fox · Feb 21, 2024

modine2021 said:
hmmm... uses notebook. i don't use that

Kohoya ss is a web ui specifically for training different types of models. It has dreambooth, lora, ti etc. Some useful tools for the preparation process such as making the captions. I assume that you are doing the training in A1111.
Kohya ss is far superior as far as I remember. I have only trained one lora that I consider a success, though I have done many practice runs and experimentation runs etc.
For just trying out training a lora I suppose A1111 is fine for this but if you are serious about it I think kohya ss is the better option.
Aitrepeneur kohyas ss lora tutorial:

You must be registered to see the links

The basic settings in this video is only to get you started, you need to figure out the best settings for your own scenario yourself.

This rentry guide is very useful:

You must be registered to see the links

Which ckpt model you choose to train your lora on is very important, it's best with a model that is responsive and consistent.
Don't use an ancestral sampler, you want consistency. Choose one of the well established classics, Euler, DPM++ 2m Karras, DPM++ SDE Karras etc.
The next thing is which optimizer you use, AdamW8bit or AdamW is good as a start.
Next is the learning rate, don't use a too fast setting as it tend to make an overtrained lora.
Then "Net dim (Rank)" settings, 128 for both is a good base but you can try lowering it slightly depending on what type of lora you are training such as style or character etc.
Learn about the concept or topic of dampening. It refers to settings that has the secondary effect of slowing down the learning rate.
There is a section about it in the rentry guide.

You must be registered to see the links

Something to try is a bit of "denoise offset", it makes the image sharper and more colorful but if you overdo it, the image can look "burnt". I used a very low setting (0.1) for my lora with good result but it's not strictly necessary. Consider using clipskip 2 as it might give better results.

There are some general recommendations in the section " Starting settings and "optimal settings" .

You must be registered to see the links

Something to keep in mind is that if you use bad images to start with you will not get a good result so be very selective in choosing your images for the data set. I think it's best to not use too many, 20-30 ish is a good number.
The captions are very important as well. It's fine to use the tool in kohya for auto captions as a starting point but it's well worth the time to go through them manually to adjust them. If your gpu can handle it go with 768 instead of 512 resolution.
It's not necessary to use 1:1 images it can be either portrait or lanscape just make sure to upscale and crop etc manually with photoshop or similar so you don't have a bunch of variations. you can have a some variations ofc, just not too much. Make sure to enable "photobuckets", this will take care of the variations for you.

Good luck.

Sharinel · Feb 21, 2024

Mr-Fox said:
It's not necessary to use 1:1 images it can be either portrait or lanscape just make sure to upscale and crop etc manually with photoshop or similar so you don't have a bunch of variations. you can have a some variations ofc, just not too much. Make sure to enable "photobuckets", this will take care of the variations for you.

Good luck.

On this very last point, I like to use

You must be registered to see the links

to crop any pics I have to the correct resolution for training. most of my loras these days are on SDXL so I train to 1024x1024.

Apart from that, I'm going to use everything that El Foxy has said here as I didn't do half of that

[Stable Diffusion] Prompt Sharing and Learning Thread

Member

Well-Known Member

Member

Well-Known Member

Active Member

Well-Known Member

Well-Known Member

Well-Known Member

Active Member

Well-Known Member

Newbie

Member

Well-Known Member

Member

Well-Known Member

Member

Member

Well-Known Member

Well-Known Member

Active Member