minus-squaremarblemunkey@alien.topBtoLocalLLaMA•Question about GGUF, gpu offload and performancelinkfedilinkEnglisharrow-up1·1 year agoIt’s been a couple months since I used less-than-complete GPU offloading; When I was using my Alienware laptop (i7-8th gen, 2060 6GB) to run 13B models with 13/25 layers offloaded I was getting 1-2 t/s, so yours sounds low. linkfedilink
It’s been a couple months since I used less-than-complete GPU offloading; When I was using my Alienware laptop (i7-8th gen, 2060 6GB) to run 13B models with 13/25 layers offloaded I was getting 1-2 t/s, so yours sounds low.