You are already missing an important detail here. Where will the painter take "a good amount of text"? Without it it will be a bunch of low connected or not logical images.
There's any number of game engines out there that will let you throw together the bare bones of a game. Even HTML will work. I'm not saying even something relatively simple doesn't have a learning curve and is easy to do. But I think being able to do good renders is harder, because it requires both skill and some artistic ability. Many devs won't even try to do their own renders, and will instead opt for "real porn" or stuff posed in video games. Or they just go text.
As I said, I don't need a fancy interface. I'm happy with some images with some text under them, and even just hitting space bar or hitting a number on the keyboard to select an option. I played TADS, Adrift, and other largely text-based games for years before people like Goblinboy came along and revolutionized things with games with professionally rendered graphics. I kind of miss the days of kiss, rub x 3, lick x 3, fuck. I get if you're trying to make a game for Steam of something like that, but otherwise if you have good renders and a halfway decent story, you'll get followers, regardless of what engine you use or how well you use it. I'd rather play a game from a guy that can create great renders but can barely code over a guy that's just a great coder but can't render worth crap.