@JoseConseco_ - Power User

0 Posts
1 Comment

Joined 1 year ago

Cake day: November 14th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

JoseConseco_@alien.topBtoLocalLLaMA•ExLlamaV2: The Fastest Library to Run LLMs
link
fedilink
English
arrow-up
1·
1 year ago
So how much vram would be required for 34b model or 14b model? I assume no cpu offloading right? With my 12gb vram, I guess I could only feed 14bilion parameters models, maybe even not that.

link
fedilink