Messing around with Yi-34B based models (Nous-Capyabara, Dolphin 2.2) lately, I’ve been experiencing repetition in model output, where sections of previous outputs are included in later generations.

This appears to persist with both GGUF and EXL2 quants, and happens regardless of Sampling Parameters or Mirostat Tau settings.

I was wondering if anyone else has experienced similar issues with the latest finetunes, and if they were able to resolve the issue. The models appear to be very promising from Wolfram’s evaluation, so I’m wondering what error I could be making.

Currently using Text Generation Web UI with SillyTavern as a front-end, Mirostat at Tau values between 2~5, or Midnight Enigma with Rep. Penalty at 1.0.

  • Ravenpest@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    No issues here, just a lot of confidence on certain tokens but overall very little repetition. I use Koboldcpp, Q5 K M. Dont abuse temp, the model seems to be exceedingly sensitive and the smallest imbalance breaks its flow. Try temp 0,9, rep pen 1.11, top k 0, min-p 0.1, typical 1, tfs 1.

    • estacks@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      I’ll have to try these settings, I have OPs problems too and I always have to crank the temperature up to get it to work. Then it gets schizophrenia a few messages later. Thanks!

      • Ravenpest@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        High temp does more harm than good. I would suggest looking into what the other settings do before raising it, no matter the model

    • HvskyAI@alien.topOPB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      I see, the model does tend to run a bit hot as-is. I’ll go ahead and try these settings out tomorrow.