- Mar 22, 2019
- 158
- 650
I've had some pretty great results with a merge of Pygmalion and Vicuna I found through the KoboldAI group. I've only tried it in the NSFW story context so far.
It uses just shy of 16GB VRAM (~14GB) so it fits fully on VRAM for a lot of cards, meaning pretty damn fast outputs. (15-20 tokens per second for me) I didn't have to do anything crazy to get KoboldAI to recognize it, just had to make a folder for the model in the right spot. It's more prone to re-using phrases, but honestly it's as good or better than Erebus 13B, the lows aren't nearly as bad (less incoherent freak outs that no non-schizophrenic author would write) and doesn't chew up all of my VRAM.
Standard AI stuff honestly for phrase re-use. Or girls suddenly calling their partner sir. Or if you accidentally typo someone's name, making them different in the AI. Or if you miss a tag on a world entry, and the AI can't identify/remember who this "captain" is.
Only downside to this model is that it sometimes will generate only a few words in a story instead of the full output length specified. I like starting sentences for the AI to finish, and sometimes it finishes those and goes "Right, my job here is done" after like 3 words and a period. Given that it generates so fast, though, clicking submit again ain't bad.
You must be registered to see the links
It uses just shy of 16GB VRAM (~14GB) so it fits fully on VRAM for a lot of cards, meaning pretty damn fast outputs. (15-20 tokens per second for me) I didn't have to do anything crazy to get KoboldAI to recognize it, just had to make a folder for the model in the right spot. It's more prone to re-using phrases, but honestly it's as good or better than Erebus 13B, the lows aren't nearly as bad (less incoherent freak outs that no non-schizophrenic author would write) and doesn't chew up all of my VRAM.
Standard AI stuff honestly for phrase re-use. Or girls suddenly calling their partner sir. Or if you accidentally typo someone's name, making them different in the AI. Or if you miss a tag on a world entry, and the AI can't identify/remember who this "captain" is.
Only downside to this model is that it sometimes will generate only a few words in a story instead of the full output length specified. I like starting sentences for the AI to finish, and sometimes it finishes those and goes "Right, my job here is done" after like 3 words and a period. Given that it generates so fast, though, clicking submit again ain't bad.