TabbyAPI released! A pure LLM API for exllama v2.

panchovix@alien.top · 2 years ago

TabbyAPI released! A pure LLM API for exllama v2.

a_beautiful_rhind@alien.top · 2 years ago

Nice. A lightweight loader. Will make us free of gradio.

oobabooga4@alien.top · 2 years ago

Gradio is a 70MB requirement FYI. It has become common to see people calling text-generation-webui “bloated”, when most of the installation size is in fact due to Pytorch and the CUDA runtime libraries.

https://preview.redd.it/pgfsdld7xw0c1.png?width=370&format=png&auto=webp&s=c50a14804350a1391d57d0feac8a32a5dcf36f68

tronathan@alien.top · 2 years ago

Gradio is a 70MB requirement

That doesn’t make it fast, just small. Inefficient code can be compact.

kpodkanowicz@alien.top · 2 years ago

I think there is room for everyone - Text Gen is a piece of art - it’s the only thing in the whole space that always works and is reliable. However, if im building an agent and getting a docker build, I can not afford to change text gen etc.

TabbyAPI released! A pure LLM API for exllama v2.

TabbyAPI released! A pure LLM API for exllama v2.

GitHub - theroyallab/tabbyAPI