Just read about this project on Twitter and it sounds really interesting:
https://github.com/mozilla-Ocho/llamafile
What do guys think, might be even more simple than Ollama?
Just read about this project on Twitter and it sounds really interesting:
https://github.com/mozilla-Ocho/llamafile
What do guys think, might be even more simple than Ollama?
I find it very strange attaching the gguf file to an exe - it’s a very bad security idea (your antivirus needs to hash 10 GB file) and then on windows you still need to split it to exe and data, because the exe limit is 4GB so basically instead of llama.cpp you are now using llamafile that is llamacpp. Or am I missing something?