Quantizing 70b models to 4-bit, how much does performance degrade?

ae_dataviz@alien.top · 2 years ago

Quantizing 70b models to 4-bit, how much does performance degrade?

Dry-Vermicelli-682@alien.top · 2 years ago

44GB of GPU VRAM? WTH GPU has 44GB other than stupid expensive ones? Are average folks running $25K GPUS at home? Or those running these like working for company’s with lots of money and building small GPU servers to run these?

MiniEval_@alien.top · 2 years ago

Dual 3090/4090s. Still pricey as hell, but not out of reach for some folks.

Dry-Vermicelli-682@alien.top · 2 years ago

So anyone wanting to play around with this at home, has to expect to drop about 4K or so for GPUs and a setup?

drifter_VR@alien.top · 2 years ago

I can get 2 3090 for 1200€ here on the second-hand market