So this didn't go as expected...
FYI this went from a tutorial to a plea for help lol
Couldn't determine on a universal themed Big Brother thread so I guess I'll post this in all the modding threads so long as I get some positive feedback on the manner.. don't know who has thought of this or knew about this..
If the pros know this, then I provide for the noobs.
After I figured out how damn easy it was to produce any sequence of images so long as you had identified the dialog that belonged said sequence (thanks modders) the next step for me was to further the simplification process of going about all this.
Naturally, the first task to simplify this process would be to disable word wrap.
Before:
After:
The next thing to do is put all the sequences in a partial order. Not 100 percent order as there is no such thing within this game, but rather a more sensible order in that sequences follow each other and they do it in a way that follows the events of that chapter (or version). What I mean by this is DS currently is prefacing the scene identifier [ 05.aboutvic.0 ] with the number that corresponds to that version. In the example I used it is obviously 05. He hasn't been consistent in his scene identifier naming scheme, but it would appear that he is going to stick with the number prefix.
Using the TextFX plugin or features of Notepad++ (Pretty sure it comes stock with n++ as it doesn't exist within the plugins menu with all the other plugins; but still easy if I was wrong as there is a plugin manager that auto gets) we can use the ordering feature.
- First, CTRL + A to select all (mind you that when we start the sort this will take a while as there are tons of lines of text, so even if n++ appears to be not reponding just wait. Think it took me a min or two but I have i7)
- Go to TextFX -> TextFX Tools -> Sort lines as case insensitive (at columns)
dia.txt should now look like this:
The last thing that we can do is remove all the Russian text that only creates noise (IMO). We can achieve this with the awesome gift of the coding gods, RegEx, or regular expressions.
Now IDK the difference between Java RegEx, HTML5 RegEx or C# Regex, but I do know how to make regex work in n++ which I believe is a partial RegEx engine.
Bare with me..
After doing this writeup I realized I had my other one wrong. Now I ask for some help if we have RegEx ninjas in here by any chance. Only I would go from thinking I was helping to asking for help lol. At least I'm honest about it and don't change the nature of the message.
Here is the problem with my other RegEx:
05.aft2kir.7@100@
We need the first @, the one followed by #'s (100 in this case) to be ignored and start at the second
Since the scene identifier has two ampersands ' @ ', the RegEx finds the first and ignores the second as it reads all characters between ' @ ' and ' | '. Now from research, it appears that there is no way to tell RegEx to skip first occurrence of a match, ' @ ' in this case, and have it start at the second, the one we need it to stop at. I could have got this wrong thought, the idea that we can't do this.
If this is the case, unless you tell me otherwise, it seems we can use IF statement to say:
We want the ampersand that is followed by characters and not numbers; i.e. IF #'s follow @ ignore, if anything that is NOT numbers (I go this route because it seems to be easier than the inclusion of the RU Unicode character set) follows @ START.
I know stack exchange is more ideal for this, but I will not go there as I have no attempted equation as I am a RegEx noob, not a Ninja.
Here is how the text looks with the one grouping removed:
Well shit, I guess it still looks better no word wrap and sorted alpha... FML