so I got this shiny new GPU and I want to push it to the limit. What’s the most powerful, smartest model out there? Ideally something with as much long-term memory as possible. I’m coming off of ChatGPT 4 and want something local and uncensored

  • crackinthekraken@alien.topOPB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    I’m willing to wait for quality so that’s no problem!

    Where can I go to find these models? And how do I set them up and get them running?

    • Sea_Particular_4014@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      If you’re on Windows, I’d download KoboldCPP and TheBloke’s GGUF models from HuggingFace.

      Then you just launch KoboldCPP, select the .gguf file, select your GPU, enter the number of layers to offload, set the context size (4096 for those), etc and launch it.

      Then you’re good to start messing around. Can use the Kobold interface that’ll pop up or use it through the API with something like SillyTavern.