panchovix@alien.topB to LocalLLaMAEnglish · 1 year agoTabbyAPI released! A pure LLM API for exllama v2.github.comexternal-linkmessage-square6fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkTabbyAPI released! A pure LLM API for exllama v2.github.companchovix@alien.topB to LocalLLaMAEnglish · 1 year agomessage-square6fedilink
minus-squaretronathan@alien.topBlinkfedilinkEnglisharrow-up1·1 year ago Gradio is a 70MB requirement That doesn’t make it fast, just small. Inefficient code can be compact.
That doesn’t make it fast, just small. Inefficient code can be compact.