@chewbie - Power User

0 Posts
1 Comment

Joined 2 years ago

Cake day: November 29th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

chewbie@alien.topBtoLocalLLaMA•Macs with 32GB of memory can run 70B models with the GPU.
link
fedilink
English
arrow-up
1·
2 years ago
Does anyone know how many stream of LLAMA 2 70b a apple studio can run in parrallel ? Does it need the same amount of ram for each completion, or does llama.cpp manage to share it between different stream ?

link
fedilink