I thought I'd explain how the AI chatting works. The recommended LLM is a 11b which could be ran easily on 16gigs of VRAM. I'm using Kobold.
It seems the initial prompt is around 1000 tokens long. I suspect running the LLM at a context limit of ~8000 is fine. The prompt included all the stats of the characters. You can type in whatever you want in your communication with the characters in game and the LLM will generate the response. Obviously an 11b is not SUPER smart but it's good enough.
Sometimes it works fine and other times it crashes the game.
Whatever you type into the chatbox is weighed against the character's likes/dislikes and adds or takes away from the friendship/love points you have with the character.
They used the LLM in clever ways in the game so all the information you get in regular game chat could be gained in AI chat. If you ask for a phone number and the AI responds by saying yes, you get that character added to your contacts list in game + a friendship point boost. The same happens when you ask for interests. However, it's not a back and forth conversation. You're the one always asking the questions due to how the AI works.
I'm a little too lazy to organize this prompt, but this is the information that's sent to the AI so they can generate a response.
You can describe your character's actions and the other character you're talking to will respond with described actions of their own.