@aliencaocao - Power User

0 Posts
1 Comment

Joined 1 year ago

Cake day: November 14th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

aliencaocao@alien.topBtoLocalLLaMA•NVidia H200 achieves nearly 12,000 tokens/sec on Llama2-13B with TensorRT-LLM
link
fedilink
English
arrow-up
1·
1 year ago
Batchsize 1024 though…not for personal use case

link
fedilink