I have been playing with the GenAI mod posted above, and tried to modify it to use img2img instead of text2img generation based on the original images in the game, which turned out to be relatively simple to do, and yielded pretty decent results - it's not perfect and required some prompt tweaking, but overall the quality of the graphics is improved IMO, while keeping a lot of the charm and character of the original. Below are some examples of original images, and what StableDiffusion was able to generate from them:
View attachment 4812057
View attachment 4812058
View attachment 4812059
View attachment 4812060
View attachment 4812063
View attachment 4812064
View attachment 4812065
View attachment 4812066
View attachment 4812110
I feed character name into the prompt, and its surprisingly consistent in the face it generates for each character, while also being able to generate diversity of types and faces. Using img2img also more closely follows the outfits (and poses as well) in the game, where the text2img tended to generate its own outfits based on the prompt, which were not always very similar, and kept changing with poses. Each approach of course has its advantages, well setup text2img can also be fun to play.
Also I have been using SDXL Lightning models to do this, which can generate usable results in as little as 4 steps, much faster than regular models. So far my best results were obtained with
You must be registered to see the links
using the LCM sampler, 7 steps (4-8 works), 1.75 cfg (1-2 works), 0.5 denoising strength (can vary this number to stick closer to the original, or give SD more freedom to deviate) and 768x1152 resolution, and also using ADetailer for face improvement.
The Lightning models work really fast, I can generate an image in a few seconds, which makes this much more usable when playing the game. Regular SDXL models that require 20-30 steps take significantly longer, slowing down the flow. Some other models I had good luck with were
You must be registered to see the links
,
You must be registered to see the links
and
You must be registered to see the links
My focus also was mostly on realistic images - have not played much with more stylized models, anime, etc.
If anyone is interested in the modified GenAI mod files, I can post them. It's a bit hacked together right now (was just messing with it for personal use), but the changes are relatively simple - anyone who got the a1111 API to work in the game, and has any basic python knowledge could easily get this going as well.
It could probably be cleaned up and incorporated into the mod at some point (maybe the author of the mod can consider adding img2img as an option)