Mr-Fox
Well-Known Member
- Jan 24, 2020
- 1,401
- 3,811
- 321
Very nice images.Yeah, I tried a few different methods, my current one actually involves making an initial image without the lora, but using IP Adapter for a basic likeness with SDXL models to get a first pass with better colors and composition (but lower facial fidelity), and then using img2img with ControlNet to recreate that visual with SD1.5 + Lora, to add the likeness back to the image. I don't usually like using faceswap methods after because I feel they tend to make facial expressions look too bland for my taste, though I can't say I have really tried ReActor yet. But at least in my personal experience, and for my needs I feel that loras give me higher fidelity than any other method I have tried while giving me full control over character pose and composition on the prompt, and I usually don't feel the need to add another step to increase that fidelity.
While I won't say that I never have issues getting a good resemblance when using loras, I feel that it's usually the result of a lora that didn't get trained with the best source images or just the random nature of Stable Diffusion, but I have some loras that to me are basically as good as it gets, so the inconsistency problems you describe, again in my experience are less due to an issue with limitations of the technology, and more a case of bad implementation (i.e. poorly trained loras).
For example, here are some generations using my custom models compared to photos of the real women trained in the loras:
View attachment 3386539 View attachment 3386540
View attachment 3386541 View attachment 3386559
View attachment 3386569 View attachment 3386572
View attachment 3386573 View attachment 3386578
View attachment 3386579 View attachment 3386589
So yeah, that's why I don't think loras are going anywhere. If you want to be able to create images from the prompt without being restricted to get a likeness from pre-existing photo, loras are a lot more flexible and lead to better quality than the alternatives... to me at least. But my quality bar for likeness is perhaps higher than the average user, so don't take my words as gospel.
Yes you might very well be right that it's poorly trained loras and/or the general nature of SD that is the reason for inconsistencies, probably both. . An alternative that I already mentioned in a different post is faceswaplabs. It has some interesting features. Like reactor you can create a facemodel from a batch of images to use instead of a single input image, but unlike reactor it allows you to use an input image also with the facemodel, you can blend the facemodel with the input image and the generated image with a slider. So you can fine tune and balance the likeness more.
Ip adapter does get you there partially but it's not as accurate as a faceswap at least in my attempts.
It was along time ago I trained a lora and your conviction of it being superior inspires me to perhaps give it a go again. Good talk.