lans_throwaway@alien.topB to LocalLLaMAEnglish · 1 year agoLook ahead decoding offers massive (~1.5x) speedup for inferencelmsys.orgexternal-linkmessage-square4fedilinkarrow-up11arrow-down10cross-posted to: localllama
arrow-up11arrow-down1external-linkLook ahead decoding offers massive (~1.5x) speedup for inferencelmsys.orglans_throwaway@alien.topB to LocalLLaMAEnglish · 1 year agomessage-square4fedilinkcross-posted to: localllama
minus-squarewind_dude@alien.topBlinkfedilinkEnglisharrow-up1·1 year agoWhat would happen if you replace the decoder during finetuning? Would you also see a speed up but at the expense of vram?
What would happen if you replace the decoder during finetuning? Would you also see a speed up but at the expense of vram?