Look for a model better than MythoMax for Chat/RP

Maxumilian@alien.top · 1 year ago

Look for a model better than MythoMax for Chat/RP

Material1276@alien.top · 1 year ago

Heres a link to a up to date ranking of models for RP. Currently 400+ models ranked.

http://ayumi.m8geil.de/ayumi_bench_v3_results.html

spatenkloete@alien.top · 1 year ago

I really liked Echidna-Tiefighter. Characters act way more natural than with any other 13B model I tried.

Tacx79@alien.top · 1 year ago

What about Cat 13b 1.0? It slipped through here without much attention but it looks really good, with 16gb you could run q8

Herr_Drosselmeyer@alien.top · 1 year ago

with 16gb you could run q8

Not really though. Any kind of context will push you over 16gb. Or I’m doing something wrong.

Tacx79@alien.top · 1 year ago

GGUF? Even on gtx 1080 you get like 4t/s with q8 which is almost as fast as average person read speed, with 16gb it should be 4-5x faster

Herr_Drosselmeyer@alien.top · 1 year ago

Hadn’t thought of that. I have 24gb so I’ve always used GPTQ and with that, you really need more than 16gb.

BackyardAnarchist@alien.top · 1 year ago

these are my suggestions.

https://huggingface.co/TheBloke/cat-v1.0-13B-GPTQ

https://huggingface.co/TheBloke/Augmental-Unholy-13B-GPTQ

https://huggingface.co/TheBloke/HornyEchidna-13B-v0.1-GPTQ

and the one i keep coming back too but can barely run.

https://huggingface.co/TheBloke/MXLewd-L2-20B-GPTQ

WolframRavenwolf@alien.top · 1 year ago

Chat/RP is one of my main use cases so I test for that specifically - check out my latest LLM Comparison/Test which includes links to my previous tests.