[Stable Diffusion] Prompt Sharing and Learning Thread

Dagg0th

Member
Jan 20, 2022
208
2,043
This might be of interest to some people.
View attachment 2951512

The above pic shows the same prompt/seed combination using 2 different checkpoints.
The left hand pic is using Dreamshaper 8 while the right hand is using EpicRealism.
The one in the middle is using both. It starts off with Dreamshaper then uses the Refiner tool in Automatic1111 to then morph into the EpicRealism. You can get some interesting outcomes depending on how you do the merging.
View attachment 2951501

Prompt is "beautiful female standing next to desk wearing __CC_female_clothing_set_business__****, deep cleavage, photorealistic, wide hips, closeup, textured skin, skin pores, looking down at camera, thicc thighs, gigapixel, 8k, cinematic, fov 60 photo of perfecteyes eyes, perfecteyes eyes, <lora:more_details:1> <lora:GoodHands-beta2:1>

****This is a wildcard, came out as Trousers and boat neck top
I do something similar, but instead of using refiner, I do the checkpoint switch on highres.fix, i'll do a comparative of wich one works better, stay tune.
 
  • Like
Reactions: Sharinel and Mr-Fox

felldude

Active Member
Aug 26, 2017
533
1,537
I've installed both the and the for normally I like building my own pipelines but if anyone has a good SEGS setup or can link to one.

For those not familiar it almost turns SD into DALI with the way it handles drawing objects, less turning clouds into hair.

Here is a by someone with way more experience then I currently have using it, he also created a UI manger but I haven't gone that path.

Here is a showing a form of auto mask generation for inpainting, I have seen combinations of SEGS with Automasking and auto prompting from image detection but have never found a shared workflow.
 
Last edited:
  • Like
Reactions: Mr-Fox

me3

Member
Dec 31, 2016
316
708
There any other way of getting the same angle/level and "head to thigh/knee" as "cowboy shot" without the obvious issues that causes in prompts.
Pure prompt only, NO lora, TI, controlnet, openpose, easy-whatevershit...
 

Mr-Fox

Well-Known Member
Jan 24, 2020
1,401
3,794
There any other way of getting the same angle/level and "head to thigh/knee" as "cowboy shot" without the obvious issues that causes in prompts.
Pure prompt only, NO lora, TI, controlnet, openpose, easy-whatevershit...
Here's an image with some useful photography terms that describes the composition. Instead of "cowboyshot" you can try "medium full shot". I guess you could also try to describe what should be included in the image.
"knees thighs torso and head in image" or similar phrasing.
Alternatively simply use the DeepBooru style of tags: knees, thighs, torso, head (face).

1695918801769.png
 

me3

Member
Dec 31, 2016
316
708
Hmmm, forgot about the "medium full"...should fix all the cowboy dressup issues, question is if the AI will know it and if it'll split it up in many cases.
So far it seems to be treating it as just a medium shot, with the occasional full.
This could obviously be down to model and/or seed
 
  • Like
Reactions: Mr-Fox

felldude

Active Member
Aug 26, 2017
533
1,537

Trained for native 1k generation and up scaling to 2k (No Highrez fix)

I thought with the BF16 and ability to train beyond 1k, why not try to teach SD to put details in the right spots at high resolutions. (Even at .5 normal sampler img to img at 2k)

Nude female image training, non pornographic

ComfyUI_00440_.png
 

alij8000

New Member
May 6, 2018
4
20
anyone have a navel penetration lora or prompt? i have managed to create the scenes with controlnet but it takes too much time doing it manually since stable diffusion really doesn't understand the concept, understandably xD.
 

felldude

Active Member
Aug 26, 2017
533
1,537
How do I put two separate character LoRA's in one image?
Putting both at the same time will combine the two, you can in-paint the face or body using masking or adetailer.

Or simply run the image twice with one lora then the other and combine the images in Gimp.
 
  • Like
Reactions: Mr-Fox

felldude

Active Member
Aug 26, 2017
533
1,537
Testing how the works with other lora's.
In this case


Native Generation in SD at 768x1024
ComfyUI_00658_.png


2k Upscale

ComfyUI_00659_.png

While it looses some of the body detail, considering 2048 is more the double the expected render resolution, I would be curious to see if you could train XL to generate at 2k and upscale natively at 4k, I don't have the PC stats for that though.
 
  • Like
Reactions: devilkkw and Mr-Fox

me3

Member
Dec 31, 2016
316
708
You should be able to do both 4 and 8k natively just fine, both with and without upscaling, at very little to no loss.
I can't remember what it's for XL as that is far beyond what i could even hope to work with so i've not been digging for info about it.
Also interested in seeing how Würstchen will work and do.

On a sidenote, that lora seem to not care about likeness, but considering they don't seem to have cropped out the text in the training data it's probably too big an ask to make it look like the actual person.
 
  • Like
Reactions: Mr-Fox

felldude

Active Member
Aug 26, 2017
533
1,537
You should be able to do both 4 and 8k natively just fine, both with and without upscaling, at very little to no loss.
I can't remember what it's for XL as that is far beyond what i could even hope to work with so i've not been digging for info about it.
Also interested in seeing how Würstchen will work and do.

On a sidenote, that lora seem to not care about likeness, but considering they don't seem to have cropped out the text in the training data it's probably too big an ask to make it look like the actual person.
XL was trained at 1024x1024 it appears Würstchen V2.0 is also. According to their paper they have generated 1024x2048
 
  • Like
Reactions: Mr-Fox

me3

Member
Dec 31, 2016
316
708
I know what they were trained at, not what i meant.
Anyway with Würstchen i was more thinking about their claim of being much faster than XL and what that means for memory usage and compatibility with older system.
Same with AITemplate i guess, but i can't set that up because ppl don't bother buildiing things well, unfortunately.
 
  • Like
Reactions: Mr-Fox

felldude

Active Member
Aug 26, 2017
533
1,537
I know what they were trained at, not what i meant.
Anyway with Würstchen i was more thinking about their claim of being much faster than XL and what that means for memory usage and compatibility with older system.
Same with AITemplate i guess, but i can't set that up because ppl don't bother buildiing things well, unfortunately.
Read both papers and draw your own conclusion. Both models are limited to about 2x the trained latent image generation size even when finetuned.





Almost all the models self evaluate or AI evaluate against recreation of the FID-COCO-30k set of images
 
Last edited:
  • Like
Reactions: Mr-Fox

hkennereth

Member
Mar 3, 2019
232
746
Thought I could share a tip here for anyone looking to create consistent-looking characters without having to rely on LoRAs, using just some openly available checkpoint and prompting: if you ask for a mix of a few known celebrities, SD will create a person that merges the facial features of all of them into a "new" person, but this way you can pretty consistently output that same person.

For example, on every image below I had mix of Sarah Shahi and Vanessa Hudgens and Nina Dobrev as part of the prompt, with a few differences on the rest of the prompt for each picture to describe clothing, visual style, etc. Hope this helps (and that it wasn't already explored on the previous 100-something pages).

img_upscaled_0043.jpeg img_upscaled_0044.jpeg img_upscaled_0042.jpeg img_upscaled_0041.jpeg img_upscaled_0039.jpeg img_upscaled_0038.jpeg img_upscaled_0037.jpeg img_upscaled_0036.jpeg img_upscaled_0035.jpeg
 

Mr-Fox

Well-Known Member
Jan 24, 2020
1,401
3,794
Thought I could share a tip here for anyone looking to create consistent-looking characters without having to rely on LoRAs, using just some openly available checkpoint and prompting: if you ask for a mix of a few known celebrities, SD will create a person that merges the facial features of all of them into a "new" person, but this way you can pretty consistently output that same person.

For example, on every image below I had mix of Sarah Shahi and Vanessa Hudgens and Nina Dobrev as part of the prompt, with a few differences on the rest of the prompt for each picture to describe clothing, visual style, etc. Hope this helps (and that it wasn't already explored on the previous 100-something pages).

View attachment 2983515 View attachment 2983514 View attachment 2983517 View attachment 2983518 View attachment 2983519 View attachment 2983520 View attachment 2983521 View attachment 2983522 View attachment 2983523
You can also do this either with tag mixing or keyword weighting.

Tag mixing:

1696683032563.png
As you can see it's essentially like using a "refiner". The first person is the main checkpoint and the second is the "refiner" but the weight is reversed. This means that if you want the "refiner" to have more impact you need to increase the number. If you want the checkpoint to have more power the opposite is true. Don't use "[ ]", use normal brackets "( )" instead. It tend to give you and error with SD if you use "[ ]" .

Keyword weighting:

This can be used for blending more than 2 faces.
(Emma Watson:0.5), (Tara Reid:0.9), (Ana de Armas:1.2)
1696683348734.png

Source:
 
Last edited:

Mr-Fox

Well-Known Member
Jan 24, 2020
1,401
3,794
Bonus tips..

You can generate a portrait with either SD1.5 or XL and let the SD only focus on the face regardless of being a mix or not. This will give the face more detail. Then you simply use this face with the extension "roop" when you are generating your character. You can also use this method with img2img to make "photobashing". Meaning taking a photo and using "roop" on top of the photo to create fakes.
When using img2img roop will not be part of the generative process, it will only paste the face on top of the existing bone structure. This will obviously not give a good result in all scenarios. For best result use this method with txt2img when generating a new image.

Face:
00015-3350911957.png

You don't have permission to view the spoiler content. Log in or register now.

Character using roop:
00021-2124235687.png

Using a photoshoped photo of Angelina with blonde hair in roop:
00019-2124235687.png
You don't have permission to view the spoiler content. Log in or register now.

Without roop:
00023-2124235687.png