currytrash97@alien.topB to LocalLLaMAEnglish · 2 years agoA100 inference is much slower than expected with small batch sizeplus-squaremessage-squaremessage-square2linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareA100 inference is much slower than expected with small batch sizeplus-squarecurrytrash97@alien.topB to LocalLLaMAEnglish · 2 years agomessage-square2linkfedilink