AI [AI Art] - Show Us Your AI Skill *NO TEENS*

4.80 star(s) 6 Votes

Jimwalrus

Well-Known Member
Sep 15, 2021
1,102
4,197
I've used Dream shaper and realistic vision...its probably off due to me using a celebrity face for "consistency"

View attachment 2642412
View attachment 2642413
I appreciate this isn't the Stable Diffusion prompt sharing and learning thread, but I've had a look at the prompts and parameters you used.
Here's some feedback, using that rather nice second image for examples:

Firstly, I'd recommend against using any celebrity pre-baked into Stable Diffusion. They're never any good if you want a true likeness and can often result in slightly wonky faces. In this instance it's not too bad and easily fixed - see later. Incidentally, good choice in using Irina - she has the face for 'Barbarian Queen" :D
Should you want a true likeness of a celeb, download a TI or LoRA from Civitai or elsewhere. Or train one yourself if there isn't one or it's not giving what you want.

Secondly, too many steps. 150 is far too many for Euler_a. Anything more than about 40 is probably into the realm of diminishing returns, 70-80 is the point at which anything more is largely a waste of time and risks introducing unwanted elements. Higher step numbers work better with some samplers, but Euler_a isn't one of them. Sometimes, less really is more!
Here are the same prompts etc, at 40, 60 & 80 steps instead:
00008-40steps 1120731776.png 00009-60steps 1120731776.png 00010- 80 steps 1120731776.png

Thirdly, if you have the facility, use Hires.fix within the initial generation process, not upscaling afterwards. Even if it's only to increase it by 8 pixels in each direction, it allows considerable improvement of the image. A far better use of your GPU (or Google Colabs tickets) than excessive generation steps.
Here is 50 gen steps, 50 Hires steps, ESRGAN_4x, 0.33 denoising strength, to just x1.05 upscale (so no real effort for the GPU):
00011-1120731776.png

Fourthly:
If you're going for anything like photorealism, try using Restore Faces. Doesn't always work, but usually does. IF the style is more cartoony then it screws it up, but for an image like this, it should help:
00012-1120731776.png

Fifthly:

Try using a few negative prompts. They can reinforce concepts as well as guard against things going wrong.
You can also download pre-baked negative embeddings from Civitai and other places.
Adding 'Unspeakable-Horrors-32v', 'bad-hands-5' and 'easynegative' makes for a marked stylistic change. Not 100% sure I like it in this instance, but it's worth bearing in mind. Also, there wasn't really anything wrong with the original image that needed fixing with negative prompts.
00013-1120731776.png

Finally, if you can, try changing the aspect ratio of the image. Keep one side as 512 pixels, then adjust the other.
You're going for a standing person, so best to have a taller image.
Here's 50 steps, 512x832, with 50 steps of x1.05 upscaling, no negative prompts:
00016-1120731776.png

Here's the final result, upscaled within the initial generation process by 3x to 1536x2496:
00018-1120731776.png
(Apologies everyone, I know it's a big file to download, but I wanted it to display in full image size)
This resolution is about the limit of my GPU without using Tiling.
 

felldude

Active Member
Aug 26, 2017
572
1,724
I appreciate this isn't the Stable Diffusion prompt sharing and learning thread, but I've had a look at the prompts and parameters you used.
Thanks for the feed back, I recreated the image using the same settings.

Without the scaling it took about 15 seconds to create the 512x832, and another 4 seconds to 2x scale with NGX
With R-Ersgan 4x+ it took just over 5 minutes to render
No doubt it creates better detail then just NGX scaling:

Side by side in ICAT ICAT.png

My main take away was I shouldn't use 512x1024 as that was causing my image to be duplicated rather then rendered in portrait mode :ROFLMAO:

Post note if those negitives you listed fix the hand issue I'll be sure to use that also
 
Last edited:
  • Like
Reactions: VanMortis

Jimwalrus

Well-Known Member
Sep 15, 2021
1,102
4,197
Thanks for the feed back, I recreated the image using the same settings.

Without the scaling it took about 15 seconds to create the 512x832, and another 4 seconds to 2x scale with NGX
With R-Ersgan 4x+ it took just over 5 minutes to render
No doubt it creates better detail then just NGX scaling:

Side by side in ICAT View attachment 2643758

My main take away was I shouldn't use 512x1024 as that was causing my image to be duplicated rather then rendered in portrait mode :ROFLMAO:

Post note if those negitives you listed fix the hand issue I'll be sure to use that also
Glad you've taken the feedback in the constructive and positive nature in which it was intended.

You can always do a small-scale upscale (say 110%) during the initial generation, then upscale separately afterwards. Probably gives best of both worlds in terms of quality improvements as it denoises and speed as it upscales.

I'd always recommend not using a multiple of 512 to go bigger, always step it up or down by 64 pixels at least. Even if you have one image at 512x1024 it's quite possible you'll end up with some absolute horror as it tries to create two images and stitch them together!

Nothing I've seen will "fix" the hand issue, but Bad Hands 5 seems to help for photorealistic images, Bad Hands 4 seems best for anime/cartoons.
 

daddyCzapo

Member
Mar 26, 2019
245
1,507
R-Ersgan 4x+ 2.0x is the most I can muster with an 8GB card, with NGX super scaling 4x (Down scaled back to 4k for the site)

Can you guess who Im cosplaying as:

View attachment 2643833
You can also export the hi resed image to img2img and use sd upscale script, but be sure to set denosing strenght to something smaller than 0.4 cause you might end up with something like this 00000-14.0.png 00001-14.0.png . That is basically my workflow right now. Generating 30-40 images with 512x768 with one prompt, then selecting best doing hi-res x2 on it and sending it to img2img for another upscalling by 2 or 2.5 times
 
4.80 star(s) 6 Votes