How bottlenecked are LLMs by CPU clock? (Budget options to host multiple GPUs)

Infinite100p@alien.top · 2 years ago

How bottlenecked are LLMs by CPU clock? (Budget options to host multiple GPUs)

Imaginary_Bench_7294@alien.top · 2 years ago

So that really depends. You’re talking about running a multi gpu setup. If all of your model is in the gpu, then your processor will not be a bottleneck at all. The clock speed of the PCIe bus is independent of the cpu cores, unless you’re messing with overclocking. That’s why they advertise PCIe 3.0, 4.0, 5.0, etc. The PCIe version dictates the bandwidth per lane.

That being said, multi gpu setups do introduce some overhead. If a model is split between GPUs, the PCIe interface becomes a modest bottleneck as they pass data back and forth. The greater the number of GPU’s the model is split across, the greater the bottleneck.