[AI] Uncensored text generation via Oobabooga

Deleted member 1121028 · Jun 6, 2023

So lewd allowed.

Recommended :
- Install where you have some free disk space because models are quite huge.
- Nvidia card with 8Go of VRAM.
- This

You must be registered to see the links

to keep thing fresh.

You must be registered to see the links

and download/unzip one-click installer.
Click start_windows.bat.
Select your type of GPU, when asked to download a model select "Do not download a model".
Wait for it to finish completely (can take some time).
Once fully installed you should see this line :

Open your browser, type the URL and bookmark it.
(If you struggle with installation

You must be registered to see the links

)

Model VRAM usage :
Bigger the better, but choose a model that suits your card accordingly.
For exemple, if you have a 8Go VRAM card, choose a 7 billions model (7B).
(Watch

You must be registered to see the links

to reduce VRAM usage at the cost of time it takes to generate text).

Installing a model :
In the model tab, go to the huggingface model page and paste the model name and click download.
Once downloaded, refrest models list and select it.
Model can take a long time to load (30B one take around 10min on a 3090)

In the same tab change wbits & group values to the model you use :
EDIT: If your model & Oobabooga/text-generation-webui are up-to-date and contain a quantize_config.json file, you may skip this.

Uncensored models (4bits quantized version) :

Vicuna

You must be registered to see the links

(4 wbits, 128 groupsize)

You must be registered to see the links

(4 wbits, 128 groupsize)

You must be registered to see the links

(4 wbits, None groupsize)

WizardLM

You must be registered to see the links

(4 wbits, 128 groupsize)

You must be registered to see the links

(4 wbits, None groupsize)

SuperCOT/StoryTelling

You must be registered to see the links

(4 wbits, None groupsize)

Pygmalion (you need to

You must be registered to see the links

)

You must be registered to see the links

(4 wbits, 128 groupsize)

You must be registered to see the links

(4 wbits, 128 groupsize)

Saki_Sliz · Jun 6, 2023

This is awesome! def going to have to explore this now that it looks like a proper community is forming.

Deleted member 1121028 · Jun 9, 2023

Something like one day after I posted, Oobabooga/text-generation-webui got updated. If the model folder contain a quantize_config.json file, you shouldn't have to set GPTQ (wbits/groupsize) parameters anymore (it's automatic).

Saki_Sliz said:
This is awesome! def going to have to explore this now that it looks like a proper community is forming.

For the moment I'm having quite good results from my few hours testing (not that much sadly, hope I have more time this week end). I think a huge part of the community is trying to make kinky sex bots, which I'm not really interested in (no shame tho). I think of using it more in a typical VN context, as it should provide some non negligible help for not-english-native speaker.

I was thinking of making a small Renpy prototype, say around 500 renders, with at least ~90% of text wrote or recontextualized via AI and post the results here, so people have an insight of what to expect. If I find some time tho lol.

Saki_Sliz · Jun 10, 2023

Yeah, I watched more videos, many want rp bots... but I want a sort of respectable robot butler character

I've been falling behind on AI research, I can see that many of the LLM are able to use different styles of text to describe acitons, I wonder if I could get the AI to use API code so that it could control characters in a game. I know there was a research paper or article about ai character setting up a valentines party, but that was a rather large LLM that took lot of time to run, and a lot of the other RP AI assets I see on unit take some time to process. I love learning more about the smaller 6B and 7B parameter models so that they can run on more common hardware and quicker... as for the issue with token context, I think thats an architectural mis design that comes form the early LLM design, once the AI can be given a method to export text, or simplify it, and then save it on the hard drive, token limit won't be the issue anymore... but I don't know how many people realize that the work around is needed yet. my own AI was originally going to do something similar, but I wasn't working with LLM, and maybe figuring out a simpler LLM at least for contextual comprehension would really help my own AI projects.

Deleted member 1121028 · Jun 10, 2023

To be real, nothing wrong with falling behind AI research, there is a new paper almost every week, if not days. Sometime I can "get" them, sometime they are way out of my league. I would add, hysteria around it doesn't help, either doomer or sycophante. Fuck that parasitic noise tbh.

I think the problem with RP bots is the consensual approach (as "reach fastest consensus within context"), lacking real negative weight (even using various tricks/formating) and mostly limited token context (there is a censored 4bits model with 65k+ token context

, almost a small novel). But I may not have tested enough, training your

You must be registered to see the links

may significantly change the ouput (we need a candidate to scrap and parse text of the 10000 VN porn game of this site).

I think 4 bits quantization is here to stay for global use (of course outside major breakdown). I've seen Japaneses using AI into RPGM via api (it's gibberish but rather intersting). I'm not really intersed in this but you should look at how

You must be registered to see the links

or

You must be registered to see the links

manage the api.

I can't verify right now but Oobabooga/text-generation-webui save all logs in json format, check your folders. It's a bit off topic (not that much) you but you should definitely test

You must be registered to see the links

with Blender if you have time (it blow my mind lol).

Deleted member 1121028 · Jun 10, 2023

no__name said:
I was thinking of making a small Renpy prototype, say around 500 renders, with at least ~90% of text wrote or recontextualized via AI

Alright I'm game. (I will regret that.)

Saki_Sliz · Jun 12, 2023

I've been using PygmalionAI. Very good at keeping things going and generating ideas... not so good at staying consistent or sane XD but its fun to play with.

Furry_Desirer · Jun 15, 2023

Well... i'm done downloading everything with one-click installer but when I started again and it said "Conda environment is empty". How to I fixed that?

Deleted member 1121028 · Jun 16, 2023

So here my testing after a week, it's bit of a rambling but I'm dead tired.
Daz/Renpy usual storytelling setup, around ~150 renders (4 differents scenes with different chars)

Models I tested (no LoRAs) :

First it's a bit chaotic to find a model that suits you.
Model are often forked, model card (descriptions) not updated or hidden somewhere else.
Sometime you don't know which LLM trained with what other LMMs, at what ratio and what not.
There is not much metrics on global performance (it's annoying).
For exemple you could see a 13B model performings slightly better than a 30B one (so don't stress the model you use is lower):

Long story but short :
The best model I could find for lewd visual novels was

You must be registered to see the links

(mostly using instruct mode).
It should fit a 8Go VRAM card. Give as much context as you can (!). I use NovelAI-Storywriter as generation parameters preset. Model is very good if you're like me sometime lagging behind your renders.

The + :
Above everything I tested.
Long descriptions despite sometimes low context.
Very great to enrich the text in one click.
I learned more vocabulary in a week than casualy browsing the internet.

The - :
Dialogue generally.
Even chit-chat, I feel it's bad or require more time to ajust context than just write the dialogue itself (sadge).

Saki_Sliz said:
I've been using PygmalionAI. Very good at keeping things going and generating ideas... not so good at staying consistent or sane XD but its fun to play with.

You test it with TavernAI? I think that's where that model shines o/

Longawas said:
Well... i'm done downloading everything with one-click installer but when I started again and it said "Conda environment is empty". How to I fixed that?

Something obviously went wrong. Not sure I can help you.
Delete everything. As matter of reducing problems (assuming you're using win10/11)

Use the most simple path ever (D:\TextGenAi\Oobabooga, for ex)
no special chars, no space, no shaningans. And run the installer as admin.

Deleted member 1121028 · Jun 16, 2023

Addentum:

If you want to run

You must be registered to see the links

(that should generate code, I won't test it lmao but model knows Renpy and focused on python). Comment out AutoGPTQ_loader.py like that (as a quick fix) :

Deleted member 2282952 · Jun 17, 2023

Oh, that's really nice, thanks, will play around.

Furry_Desirer · Jun 17, 2023

Something obviously went wrong. Not sure I can help you.
Delete everything. As matter of reducing problems (assuming you're using win10/11)

Use the most simple path ever (D:\TextGenAi\Oobabooga, for ex)
no special chars, no space, no shaningans. And run the installer as admin.

It still doesn't work but thanks anyway. I don't know if someone installs it successfully and uploaded the whole file somewhere

Deleted member 1121028 · Jun 18, 2023

Longawas said:
It still doesn't work but thanks anyway. I don't know if someone installs it successfully and uploaded the whole file somewhere

Can't help much but have you tried to

You must be registered to see the links

?

Deleted member 1121028 · Jun 30, 2023

So finally find closure on that. Finished a small prototype but too trashy to share (even sticking to 1080, speed running the whole thing on a 3090 but still really lacked time to make something decent). Here a bit of stupid rambling about text generation with open source AI & VN :

Went further trying to exploit 8k token model context (

You must be registered to see the links

, more context should provide cleaner results) but overall results were worst (!). I feel people pushing model numbers on dataset without any kind of user utilisations. The jack-of-all trade AI that know everything is clearly detrimental to the whole (but give spicy numbers).

As for image generation, there is not much intelligence in artificial intelligence, it's fully mimicking grammar structure on what it's been trained. Anthropomorphic feelings will fill the void o/. It was expected somehow, but I needed to see it lmao. (in before sentient AI-agi take the world

)

Deleted member 2282952 · Jul 1, 2023

I have also extensively tested it, 24 GB VRAM is apparently not enough, and with a cap that prevents fallback, the processing time is just impossibly long.

Personality prompting is kinda good, but can't really tell because there is a significant limit on the input size it can process.

There is potential, but unusable right now.

EDIT: the 30B model, didn't see your 13b message - will check it out

Deleted member 2282952 · Jul 1, 2023

Sooooo, after spending more time on it, this is the quickest and most functional model for me:

You must be registered to see the links

The 30B+ models are just nuts. Take way too long, even on powerful systems.

But if you wanna go wild and get 1 response every hour unless you run a crypto mining farm:

You must be registered to see the links

Deleted member 1121028 · Jul 1, 2023

FunDi said:
I have also extensively tested it, 24 GB VRAM is apparently not enough, and with a cap that prevents fallback, the processing time is just impossibly long.

Personality prompting is kinda good, but can't really tell because there is a significant limit on the input size it can process.

There is potential, but unusable right now.

EDIT: the 30B model, didn't see your 13b message - will check it out

Yeah to use 30/33B-8K model on 24Go VRAM, you need to limit it to 4k context (max_seq_len to 4096 instead of 8192, and compress_pos_emb to 2 instead of 4). I found ExLlama to be magnitude faster than AutoGPTQ. If you get 1 hour response something is going wrong (even on 30/33B model). Do you see your CUDA cores being used?

I do like Chrono-Hermes model 'cause it has been trained to make long descriptive sentence, not so bad even in low-medium context (that what I was looking for). I use it in insctruct mode, so not "chat" so to say.

I found adding too much context being counter productive (at least right now). Higher hallucinations rate, grammatical structure crumbling... It's even weirder when you reach full 8k tokens context as it seems to bleed almost raw output from dataset (?). I guess it's still highly experimental but I was curious.

I think right now there is two majors problems (I may be completely wrong it's just ramblings). LLM merged on another LLM merged on another LLM, and so on... To make jack-of-all trade model instead of focusing on one task. And poor quality of dataset (main problem imho), as most open source model are trained on public available ones.

There also a feeling of pushing bigger and bigger numbers (like scaling is all the rage, Amazon and Nvidia must have a smile lol) - that looks good on paper and maybe attract fundings that way - but with no other purpose of being bigger.

I guess is too early to say anything and it's definitely had potential - just a bit bummed that there is very little intelligence in the artificial intelligence, at least right now. I wanted SkyNet, I got google advanced search

Deleted member 2282952 · Jul 1, 2023

Alrighty, after playing around with settings and models, the 13B Vicuna model is amazing, but SLIGHTLY bad when it comes to writing dialogue, because it would just randomly ignore very specific context and just go and do its own thing.

This has been my favorite story it generated. It can write about anything, no matter how messed up the request, so fun (I will delete if this stuff is not allowed):

no__name said:
I do like Chrono-Hermes model 'cause it has been trained to make long descriptive sentence, not so bad even in low-medium context (that what I was looking for). I use it in insctruct mode, so not "chat" so to say.

Too moralistic for my tastes.

no__name said:
I found adding too much context being counter productive (at least right now). Higher hallucinations rate, grammatical structure crumbling... It's even weirder when you reach full 8k tokens context as it seems to bleed almost raw output from dataset (?). I guess it's still highly experimental but I was curious.

Unfortunately, yes. I learned that ChatGPT is pretty good in helping you prompt the Vicuna model better more in CompSci terms (bunch of requests in IF statmenets)

no__name said:
I think right now there is two majors problems (I may be completely wrong it's just ramblings). LLM merged on another LLM merged on another LLM, and so on... To make jack-of-all trade model instead of focusing on one task. And poor quality of dataset (main problem imho), as most open source model are trained on public available ones.

There is so much info I am not even bothering with how they are trained, since in my understanding, it is a pretty boring exercise of labeling - boooring

no__name said:
There also a feeling of pushing bigger and bigger numbers (like scaling is all the rage, Amazon and Nvidia must have a smile lol) - that looks good on paper and maybe attract fundings that way - but with no other purpose of being bigger.

It's like man with small dicks buying huge cars, you spend more money, but the part you care about still doesn't work ... unless you can invest 10k for an enlargement surgery (very complicated metaphor).

Though, if I would be loaded, I would get 3 SLI RTX 4090 and be the king of the universe.

no__name said:
I guess is too early to say anything and it's definitely had potential - just a bit bummed that there is very little intelligence in the artificial intelligence, at least right now. I wanted SkyNet, I got google advanced search

Better contextualization and optimization - I think that the mainstream future of AI are generalized moderated web services and open source local applications that are completely unrestricted BUT limited by hardware.

I think 'intelligence' is the wrong word in AI - the current iteration is an ability to perform a singular task with an average result being better than human, or at least decreasing the total time required to complete the task with less effort and less requirements for original skills. I kinda phrased it in a complicated way, but this is they way I see it.

Deleted member 2282952 · Jul 2, 2023

This model is equivalent to GPT-4!

You must be registered to see the links

Ocdil · Jul 3, 2023

FunDi said:
This model is equivalent to GPT-4!

You must be registered to see the links

Great, thank you man, but I have an RTX3070, isn't there a 7B version?

[AI] Uncensored text generation via Oobabooga

Well-Known Member

Well-Known Member

Well-Known Member

Well-Known Member

Well-Known Member

Well-Known Member

Well-Known Member

Newbie

Well-Known Member

Well-Known Member

Developing I SCREAM

Newbie

Well-Known Member

Well-Known Member

Developing I SCREAM

Developing I SCREAM

Well-Known Member

Developing I SCREAM

Developing I SCREAM

Member