alchemist1e9@alien.topB to LocalLLaMAEnglish · 1 year agoExLlamaV2: The Fastest Library to Run LLMstowardsdatascience.comexternal-linkmessage-square22fedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkExLlamaV2: The Fastest Library to Run LLMstowardsdatascience.comalchemist1e9@alien.topB to LocalLLaMAEnglish · 1 year agomessage-square22fedilinkfile-text
minus-squareCardAnarchist@alien.topBlinkfedilinkEnglisharrow-up1·1 year agoCan you offload layers with this like GGUF? I don’t have much VRAM / RAM so even when running a 7B I have to partially offload layers.
Can you offload layers with this like GGUF?
I don’t have much VRAM / RAM so even when running a 7B I have to partially offload layers.