Can I run an LLM that takes up no more than 1-4GB of RAM / VRAM and have it answer questions using my notes, or is that unrealistic?

TheTwelveYearOld@alien.top · 1 year ago

Can I run an LLM that takes up no more than 1-4GB of RAM / VRAM and have it answer questions using my notes, or is that unrealistic?

artisticMink@alien.top · 1 year ago

Quick answer: No.

Longer answer: It depends. Passing it as context won’t work as it’s too much data among other things. So you could use a model that builds SQL to query your database according to input and either output it directly or have another model (quantized 7B) interpret it.

But generally, i see the idea of the ‘AI Assistant’ come up here regularly, and the question is do you want to rely on a LLM that just ‘makes things up’ when accessing your notes. I guess that depends on how important the subject is.