What are your thoughts on the future of LLMs running mobile?

Tree-Sheep@alien.top · 3 years ago

What are your thoughts on the future of LLMs running mobile?

NDBellisario@alien.top · 3 years ago

Latency is one thing with the internet.

Any model that can run locally doesn’t need a round trip to a datacenter. This can of course depending on computer power

Maykey@alien.top · 3 years ago

At current capabilities it’s faster to query server on the opposite hemisphere than to generate locally.

CocksuckerDynamo@alien.top · 3 years ago

round trip latency of an http request (or grpc or whatever pick your poison) is utterly insignificant compared to the time it takes to run the inference process, even for the smallest models with the fastest inference