As requested, this is the subreddit’s second megathread for model discussion. This thread will now be hosted at least once a month to keep the discussion updated and help reduce identical posts.

I also saw that we hit 80,000 members recently! Thanks to every member for joining and making this happen.


Welcome to the r/LocalLLaMA Models Megathread

What models are you currently using and why? Do you use 7B, 13B, 33B, 34B, or 70B? Share any and all recommendations you have!

Examples of popular categories:

  • Assistant chatting

  • Chatting

  • Coding

  • Language-specific

  • Misc. professional use

  • Role-playing

  • Storytelling

  • Visual instruction


Have feedback or suggestions for other discussion topics? All suggestions are appreciated and can be sent to modmail.

^(P.S. LocalLLaMA is looking for someone who can manage Discord. If you have experience modding Discord servers, your help would be welcome. Send a message if interested.)


Previous Thread | New Models

  • ehlowrld@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    TheBloke/mistral-7B-finetuned-orca-dpo-v2-GGUF

    Lets most 13B models bite the dust. I use it for a local application - thus inference on CPU-only using llama.cpp with clblast support compiled in. Generates about 10 tokens / sec. on a Dell laptop with intel i7.