Flux.1 Kontext is amazing.
The ability to give simple instructions for desired changes
WHILE STILL PRESERVING NEARLY ALL THE LOOK AND FEEL OF THE ORIGINAL when doing img2img is a game changer.
I was able to do the following two image edits within a half dozen re-renders and less than 10 mins while writing this message.
Original
Prompt: "Change so the man is sitting in a comfy chair. Keep the same facial features, hair style and clothing."
Prompt: "Change the man's pose so he is climbing a ladder. Also change the camera angle to look at the man's back. Keep the same facial features, hair style and clothing."
Here's two other random expression and pose changes from my tinkering yesterday:
This is a god-send for me.
I've been trying to get this sort of flexibility in posing or expressions, while still retaining close visual adherence to the base character image, for more than 12 months as I've worked on my game project. And so far I've made very slow progress with continual fallback to image editors (with my extreme lack of artistic talent) to try to scrape together the look I want to achieve.
Now, I've learned to do all these variations in a matter of a few evenings.
Simply amazing!
The only downside's I've encountered so far are:
- it's pretty slow: at least 5 times slower than SDXL
- only available with ComfyUI workflow wrappers, not automatic1111 (I still have not converted to comfy)
- not many LORA or similar tools available (yet), but...
It's very easy to argue that these negatives don't really matter much:
- speed doesn't matter if it's the ONLY "run in your local environment" tool so far with the capability to follow simple instructions more iike a LLM than the nasty "20 short phrases from danbooru clip" prompt style that StableDiffusion et al require.
- comfyui-only is not a barrier, I just gotta learn and get-good.
- The lack of LORA is not a big deal, because I can use a multi-step workflow with other models before or after the kontext step as needed. Besides, the output is not always "crisp" in my experience, so using sharpening/upscaling tool after is the norm for now anyway
Links:
Here's a video I used to understand how to install Flux Kontext into comfyui and use-
You must be registered to see the links
here's some more in depth written info on the various ways to use:
You must be registered to see the links