Serious inquiry: I've been tinkering a lot with finetuning and was wondering if it would be worth to buy a V100 of my own

holistic-engine@alien.top · 2 years ago

Serious inquiry: I've been tinkering a lot with finetuning and was wondering if it would be worth to buy a V100 of my own

ambient_temp_xeno@alien.top · 2 years ago

https://preview.redd.it/qo7pl73erp2c1.png?width=1703&format=png&auto=webp&s=ab2ecf26490fb6b73ee28497d2ea1610b754de59

az226@alien.top · 2 years ago

A6000 being worse than 3090 doesn’t make any sense.

freecodeio@alien.top · 2 years ago

So basically either 4090 or H100

holistic-engine@alien.top · 2 years ago

Yeah, perhaps If I am crazy enough I could just buy 3 of those and call it a day

FullOf_Bad_Ideas@alien.top · 2 years ago

I can’t corraborate results for Pascal cards. They had very limited FP16 performance, usually 1:64 of FP32 performance. Switching over to rtx 3090 ti from gtx 1080 got me around 10-20x gains in qlora training, assuming keeping the exact same batch size and ctx length, changing only calculations from fp16 to bf16.

ambient_temp_xeno@alien.top · 2 years ago

I’m not sure where this chart is from, but I remember it was made before qlora even existed.

aikitoria@alien.top · 2 years ago

Is there any such benchmark that includes both the 4090/A100 and a mac with M2 Ultra / M3 Max? I’ve searched quite a bit but didn’t find anyone comparing them on similar setups, it seems very interesting due to the large unified memory.

Mescallan@alien.top · 2 years ago

Man those h100s really are on another level. I shudder to think where are in 5 years.