My settings for "optimal" 7B Roleplay (+ some general settings tips and a discovered new hidden gem of a model)

CardAnarchist@alien.top · 2 years ago

My settings for "optimal" 7B Roleplay (+ some general settings tips and a discovered new hidden gem of a model)

teor@alien.top · 2 years ago

Can’t you export and upload your settings? It’s kind of a pain to manually type all that

CardAnarchist@alien.top · 2 years ago

/u/reiniken has reminded me of one important point I didn’t touch on much.

It’s important to replicate the style you want the AI to write in, in the first message and in your own replies to help the AI keep replicating the format.

So write narration in 3rd person and add some bracketed thoughts in the introduction message of your card if you follow my guide.

That’s why in my examples I speak myself in 3rd person. You don’t have to the AI can keep to the format without doing so from my testing but I think writing your own narration in 3rd person helps the AI keep it’s narration in 3rd person too. If it see’s your narration in 1st person it could be tempted to write it’s narration in 1st person.

eternalpounding@alien.top · 2 years ago

Thank you for taking the time to write this up, much appreciated

CardAnarchist@alien.top · 2 years ago

You’re most welcome. It’s the least I can do to give just a little back to the community which has been so helpful to me in advance.

reiniken@alien.top · 2 years ago

Can you share what your character has for settings? I like how yours is displayed, but I don’t know how I’d set that up. Or an example if you don’t want to share specifics.

CardAnarchist@alien.top · 2 years ago

It’s actually not my card I just got it from chub.ai. It doesn’t have anything in it which formats the output style (other than the fact models will generally mimic the first message format). Which goes to show the power of the “quality jailbreak” I detail above! That’s what really drills the formatting into the model.

That said I have made some minor modifications to improve it the card (mostly typo fixes, a small modification to the scenario to make it more flexible and added one line into the introduction to help the model learn the bracketed thoughts format).

Here is my modded card.

Here is the original on chub.ai.

WolframRavenwolf@alien.top · 2 years ago

Bookmarked! I’ll see what it says about Amy and my other characters. I spent a lot of time on their wording and am constantly optimizing it.

Speaking of optimizations for character cards, have you heard about Sparse Priming Representations (SPR)? I’ve experimented with it and while I’m not using it directly, I’m applying some of its principles to my cards, saving precious tokens.

constanzabestest@alien.top · 2 years ago

This is absolutely amazing but i have a question. is there a way to make it consistently generate less text? im enjoying my RPs the most when the messages are a bit more on a simpler side (around 100 tokens), but these settings make the ai generate well past the 300 token target. I tried adding stuff like “around 100 words long” or “no more than 100 words” or even “limit yourself to 100 tokens” to the last output sequence but nothing seems to work.

CardAnarchist@alien.top · 2 years ago

Hmm.

Well there is the target length (tokens) setting in SillyTaverns advanced formatting tab.

I’ve got it set to 200 as above and then the Response (tokens) setting set to 300.

The “target” is actually the setting which I’ve got set to 200. The setting at 300 is merely a “cap” it can’t go over.

So I’d start with changing the target length (tokens) to 100 and change your Response (tokens) cap to say 150-175 to give it a bit of wiggle room.

If that doesn’t work try removing the “be verbose” part of what I wrote if you are using that or edit this part to “Write multiple brief fresh sentences, paragraphs, and phrases.”

aseichter2007@alien.top · 2 years ago

Everyone is so excited about this setting, anyone know offhand how it is presented to the backend?

aseichter2007@alien.top · 2 years ago

I get good results depending on model asking for a size with some hyperbole, when I want a very short summary I ask for a one sentence summary and get the minimum ideas back, usually two or three to the point statements.

Consider what you ask for: a story or never ending roleplay will likely return longer messages than “write a concise message to reply as {{Char}} Do not write endings or drive toward conclusions”. Especially in controlling length, the words don’t trigger expected results, you gotta experiment with your lexicon a little.

You’re not going to be able to get a specific length always, but you should have good results by tuning in the direction you want until you get your desired output size more often with only outliers containing too much.

dethorin@alien.top · 2 years ago

I guess that the custom part " JSON serialized array of strings" of the “Instruct mode” is important.

I am sharing it here as plain text, so others just need to copy and paste:

["", "<|", "\n#", "\n*{{user}} ",

"\n\n\n"]

CardAnarchist@alien.top · 2 years ago

Not going to lie I updated these awhile back when I was newer to the whole AI thing based on a recommendation and I had forgotten I even edited them until you just mentioned.

Pretty sure I changed these because /u/WolframRavenwolf does it xD

Care to enlighten us why these are a good idea Mr wolf.

WolframRavenwolf@alien.top · 2 years ago

Most of these are (parts of) EOS (end of sequence) tokens. The model is supposed to send an EOS token to signal that inference is done, as without that, it would keep going until the max new tokens limit is hit.

Unfortunately some models, especially merges with different prompt formats, can get confused and output the wrong token or turn the special token into a regular string. In that case, adding that string (or a part of it) to the custom stopping strings list ensures that inference is properly concluding anyways.

In addition to that, I put the asterisk followed by username there to catch the model trying to act as the user. Just like how the software by default already includes the username followed by a colon, to catch the model trying to talk as user.

LosingID_583@alien.top · 2 years ago

Unless using some integration like stable diffusion or TTS, I would just use a prompt with the model itself. Not only is it much faster to generate responses, but it maintains better coherence because SillyTavern tends to fill up the context window with stuff it is wrapping around each response.

round brackets

I believe these are called parentheses.

CardAnarchist@alien.top · 2 years ago

Ah round brackets vs parentheses is one of those British vs American English things haha.

That said on paper parentheses probably should be the better choice as it should be less likely to be misinterpreted by the model.

I’m giving it a try with parentheses now, thanks!

MostlyRocketScience@alien.top · 2 years ago

Thank you, very useful!

CardAnarchist@alien.top · 2 years ago

Appreciated. If anyone has any issues with anything let me know.

The worst thing in my experience is the damn templates all these models have. So many unique templates with minor tweaks and some models are so sensitive!

I’ve literally given up on some models because I clearly couldn’t figure out the right template smh.

swwer@alien.top · 2 years ago

Same bro is just a mess, it’s like we have this feature, and it’s simple, but we are intently making harder…

My settings for "optimal" 7B Roleplay (+ some general settings tips and a discovered new hidden gem of a model)

My settings for "optimal" 7B Roleplay (+ some general settings tips and a discovered new hidden gem of a model)

Step 1 - Backend

Step 2 - Front end

Step 3 - The choice of model

Misted-7B

In conclusion