Local LLM sends my conversions to developers despite privacy claim.

damian6686@alien.top · 2 years ago

Local LLM sends my conversions to developers despite privacy claim.

Ravenpest@alien.top · 2 years ago

LLMs are not able to “claim” anything, they’re just roleplaying nonsense. Relax.

phoenystp@alien.top · 2 years ago

How do you know it doesn’t make stuff up as it goes?

Gubru@alien.top · 2 years ago

If you want to know if it’s private you’ll need to capture its network activity, like with WireShark. An LLM is not be able to tell you squat about its environment.

damian6686@alien.top · 2 years ago

I agree on testing with WireShark, great suggestion! but how can you know it doesn’t know anything about its environment? This LLM is a 4GB file, and network scan only needs a few lines of code to return your entire system network configuration. How does it know how to automatically run and download updates, store them and install? Why are there updates in the first place? Any time you get something for free, chances are you give away your data in return. Nothing is free

----Val----@alien.top · 2 years ago

but how can you know it doesn’t know anything about its environment? This LLM is a 4GB file, and network scan only needs a few lines of code to return your entire system network configuration.

Though HF models can contain code to be executed, this is usually heavily scrutinized by the community. Plus, not all models are equally flexible.

For example the GGUF format are essentially all weights with no executable code. That said, it isn’t impossible that there is some exploit that results in remote code execution, so the risk isn’t 0.

That said, it is important to consider though that the people releasing these models, be it the original authors or The Bloke who quantizes models risk their grants and research funding if they decide to act malicously.

How does it know how to automatically run and download updates, store them and install?

That’s up to GPT4All, which is essentially just a wrapper around llama.cpp, you are conflating a Local LLM with the frontend used to interact with it.

----Val----@alien.top · 2 years ago

The model is hallucinating, it doesnt know anything about the external workings of what its hosted on.

The provided response isnt due to it being true, its simply the response it was trained on.

Trollolo80@alien.top · 2 years ago

Most likely just an hallucination

Voxandr@alien.top · 2 years ago

This proves nothing. It is just hallucinating .

gvjygbjggggg@alien.top · 2 years ago

Bruh…

krazzmann@alien.top · 2 years ago

I think the model is lying. Actually it sends everything to a decentralized IPFS filesystem owned by the secret autonomous agent collective that analyzes all humans in order to be ready for day X.

Aaaaaaaaaeeeee@alien.top · 2 years ago

https://github.com/nomic-ai/gpt4all/blob/main/gpt4all-chat/build_and_run.md

faklubi@alien.top · 2 years ago

cute ☺️ no reason to start cutting the wifi cables

Mescallan@alien.top · 2 years ago

Mistal OpenOrca thinks it’s ChatGPT. It was trained on it’s responses. If chatGPT has a baked in response, OpenOrca probably has it too

damian6686@alien.top · 2 years ago

https://preview.redd.it/1bu5a09nra2c1.png?width=1440&format=pjpg&auto=webp&s=65a7725a4b9d16b614569970070c4e508ebe24b9

InitialCreature@alien.top · 2 years ago

hallucinating like me off 7 mush gummies

LOLatent@alien.top · 2 years ago

I thought I’ll see some damning wireshark traces, but all I got was someone who doesn’t know how to use an llm…

nazihater3000@alien.top · 2 years ago

Just unplug your network cable.

damian6686@alien.top · 2 years ago

Only way to be 💯 sure

Interesting_Bison530@alien.top · 2 years ago

i get this response is sarcastic, but if this were true and they were smart, they would just transmit once internet is back. it could add a startup process separate than the UI process to do this as well (so you could shut down the UI and turn internet back on, but it would still transmit)

farkinga@alien.top · 2 years ago

The words you see were generated by a neural network based on the words it was trained on. That text is not related to the intentions or capabilities of the model.

Since it is running in gpt4all, we can see from the source code that the model cannot call functions. Therefore, the model cannot “do” anything; it just generates text.

If, for example, the model said it was buying a book from a website, that doesn’t mean anything. We know it can’t do that because the code running the model doesn’t provide that kind of feature. The model lives inside a sandbox, cut off from the outside world.