Hardware Q's: Best model performance with 75+ 30 series GPU's?

divijulius@alien.top · 2 years ago

Hardware Q's: Best model performance with 75+ 30 series GPU's?

a_beautiful_rhind@alien.top · 2 years ago

Magic number is about 8-10 in a single box with a drop in perf across CPUs. Probably needs risers to fit.

More than 6 is not really needed unless going for finetuning or full precision.

Besides epyc boards, old xeons work as well. Supermicro made some servers for this. Can be had used and you get your cooling and shiznit all in one package.

https://www.supermicro.com/en/products/system/4U/4029/SYS-4029GP-TRT.cfm

https://www.supermicro.com/products/system/4U/4028/SYS-4028GR-TRT.cfm

“sell some GPU’s and buy an M2 Ultra Mac pro”

lol, no. Or maybe yes. Make a GPU server and sell the rest to buy a 192g mac for the best of both worlds.

I’m sure the hell not wasting my time selling and shipping 75+ individual GPU’s to whoever

If this was all 3090s, you are sitting on $52k of hardware. People don’t make that in a year.

divijulius@alien.top · 2 years ago

Thanks much for the links to specific mobos, I appreciate it. With riser splitters, I can see fitting 10 gpu’s in.

It really sounds like people think the “sell them and get real hardware” route is the best, but I really don’t have the time. Plus, they’re not all 3090’s, there’s a ton of lesser 30’s like 3070 and 3060’s too.

Standing offer to anyone in the thread - if you want to make an easy 25%, I’ll sell the lot of them to you for 75% the average used price on Ebay, and you can sell them all individually yourself.

a_beautiful_rhind@alien.top · 2 years ago

Separate out the 3090s for yourself, keep a couple of lesser cards for fun stuff and then put it up as a lot on ebay with their approximate price. Beats trusting random internet strangers.

I’m sure something like 10 3060s will sell if you price it right.

divijulius@alien.top · 2 years ago

Yeah, I’m pretty much doing this (keeping all 3090’s and water-cooled 3080’s), so your advice is solid.

When I first wrapped up my mining operation I had my niece try a giant lot of the GPU’s for a 15% cut (she sells stuff on ebay), and didn’t get any takers in a couple months with progressively lowered prices. She sold about $5k worth of cards individually, but that was it, wasn’t really worth it.

Guess it’s worth trying again via separating them into 3060ti and 3070 and whatever next time I’m traveling to her city and can drop them off to her.

_Lee_B_@alien.top · 2 years ago

One GPU per layer is an interesting approach ;)

Caffeine_Monster@alien.top · 2 years ago

You could build an infiniband cluster. The 3090 would give you most bang for buck. Though it’s a lot more work than trading out for A100s, and the extra hardware will cost. You can get 9 GPUs on an single epyc server mobo and still have good bandwidth. So we are talking about manually sourcing and building 10 boxes.

But unless you are training stuff and have cheap electricity a cluster probably doesn’t make sense. No idea why you would need ~1800GB vram.

candre23@alien.top · 2 years ago

No idea why you would need ~1800GB vram.

Homeboy’s waifu is gonna be THICC.

divijulius@alien.top · 2 years ago

Thanks for pointing me to Infiniband, another thing for me to research. Sounds like a high-bandwidth supercomputer information coordination layer, so sort of like the beowulf cluster idea.

I actually do have cheap electricity thanks to solar + honking big lifepo4 battery bank.

Is this what AWS and other places where you can rent time on H100’s do? Have a bunch of A100 and H100 servers hooked up in arrays with Infiniband?

Huge-Turnover-6052@alien.top · 2 years ago

Hold onto those 3090’s. Juice labs has a really interesting open source project that allows you to combine multiple gpus into a virtual GPU over ip.

Going to be giving it a shot myself in the next few weeks.(I have a pretty decent understanding of it at this point, feel like trading a 3090 for set-up assistance? 😉)

https://www.juicelabs.co/blog/revolutionizing-ai-infrastructure-juice-delivers-remote-gpu-with-near-native-performance