DataLearnerAI@alien.topB to

LocalLLaMAEnglish · 2 years ago

is there any other tools like vLLM or TensorRT that can be used to speed up LLM inference?

5

1

is there any other tools like vLLM or TensorRT that can be used to speed up LLM inference?

DataLearnerAI@alien.topB to

LocalLLaMAEnglish · 2 years ago

5

I know that vLLM and TensorRT can be used to speed up LLM inference. I tried to find other tools can be do such things similar and will compare them. Do you guys have any suggestions?

vLLM: speed up inference

TensorRT: speed up inference

DeepSpeed:speed up for training phrase

Chat

OldAd9530@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
Do you have any idea why MLC isn’t a more used format? It seems so much faster than GGUF or ExLlama architectures, yet everyone defaults to those
- mcmoose1900@alien.topB
  link
  fedilink
  arrow-up
  1·
  2 years ago
  Thats an excellent question.