@OldAd9530 - Power User

0 Posts
1 Comment

Joined 1 year ago

Cake day: November 11th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

OldAd9530@alien.topBtoLocalLLaMA•is there any other tools like vLLM or TensorRT that can be used to speed up LLM inference?
link
fedilink
English
arrow-up
1·
1 year ago
Do you have any idea why MLC isn’t a more used format? It seems so much faster than GGUF or ExLlama architectures, yet everyone defaults to those

link
fedilink