Hyddro26@alien.topB to LocalLLaMAEnglish · 1 year agoWhat is considered the best uncensored LLM right now?message-squaremessage-square38fedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1message-squareWhat is considered the best uncensored LLM right now?Hyddro26@alien.topB to LocalLLaMAEnglish · 1 year agomessage-square38fedilinkfile-text
minus-squareLienniTa@alien.topBlinkfedilinkEnglisharrow-up1·1 year agogguf goliath will give you best answers but will be very slow. you can unload like 40 layers to vram and your ram will still be a speed bottleneck, but i think 2 t/s are possible on 2 bit quant.
gguf goliath will give you best answers but will be very slow. you can unload like 40 layers to vram and your ram will still be a speed bottleneck, but i think 2 t/s are possible on 2 bit quant.