AutomataManifold@alien.topB to

LocalLLaMAEnglish · 3 years ago

GitHub - S-LoRA/S-LoRA: S-LoRA: Serving Thousands of Concurrent LoRA Adapters

5

1

GitHub - S-LoRA/S-LoRA: S-LoRA: Serving Thousands of Concurrent LoRA Adapters

AutomataManifold@alien.topB to

LocalLLaMAEnglish · 3 years ago

5

S-LoRA: Serving Thousands of Concurrent LoRA Adapters - GitHub - S-LoRA/S-LoRA: S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Chat

dreamingleo12@alien.topB
link
fedilink
English
arrow-up
1·
3 years ago
I’m wondering though, from an engineering perspective, when traffic is high, wouldn’t this be causing a lot of weight switching? Basically limited by host to device bandwidth.