currytrash97@alien.topB to LocalLLaMAEnglish · 1 year agoA100 inference is much slower than expected with small batch sizeplus-squaremessage-squaremessage-square2fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareA100 inference is much slower than expected with small batch sizeplus-squarecurrytrash97@alien.topB to LocalLLaMAEnglish · 1 year agomessage-square2fedilink