I’ve been using self-hosted LLM models for roleplay purposes. But these are the worst problems I face every time, no matter what model and parameter preset I use.

I’m using :

Pygmalion 13B AWQ

Mistral 7B AWQ

SynthIA 13B AWQ [Favourite]

WizardLM 7B AWQ

  1. It messes up with who’s who. Often starts to behave like the user.

  2. It writes in third person perspective or Narrative.

  3. Sometimes, generates the exact same reply (exactly same to same text) back to back even though new inputs were given.

  4. It starts to generate more of a dialogue or screenplay script instead of creating a normal conversation.

Anyone has any solutions for these?

  • Susp-icious_-31User@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    It’s just a low-parameter problem. If you’ve got the RAM for it, I highly suggest dolphin-2_2-yi-34b. Especially now that koboldcpp has context shifting, you don’t have to wait for all that prompt reprocessing. Also be sure you’re using an instruct mode like Roleplay (which is Alpaca format) or whatever official prompt format that LLM uses.