100B, 220B, and 600B models on huggingface!

Illustrious_Sand6784@alien.top · 2 years ago

100B, 220B, and 600B models on huggingface!

You_Wen_AzzHu@alien.top · 2 years ago

We need some 4090s with 500gb VRAM modified in China if possible.

iCantHack@alien.top · 2 years ago

I wonder if there’s any real demand for even 48GB 4090s enough to incentives somebody to do it. I bet the hardware/electronics part of it is trivial, tho.

BangkokPadang@alien.top · 2 years ago

If people started doing this with any regularity, nVidia would intentionally bork the drivers.

mpasila@alien.top · 2 years ago

the devs mentioned that the 600B model takes about 1,3TB space alone…

9wR8xO@alien.top · 2 years ago

Make it 0.01bpm quantized and you will fit in good ol’ 3090.

MannowLawn@alien.top · 2 years ago

Give it 5 years with the Mac Studio. Next year 256gb, will go up real quick.

BangkokPadang@alien.top · 2 years ago

Honestly, a 4bit quantized version of the 220B model should run on a 192GB M2 Studio, assuming these models could even work with a current transformer/loader.

LocoMod@alien.top · 2 years ago

We need some hero to develop an app that downloads more GPU memory like those apps back in the 90’s. /s