In my mind, “spicy” is just some extra cursing, humor, etc. Basically a model that is more fun, and less moralizing.
Unfortunately, AI safety doomers have a very different definition of “spicy”. To them, “spicy” is reconstructing and releasing the 1918 influenza virus to commit bioterrorism (by fine tuning spicyboros to have this sort of information).
And this is why we can’t have nice things.
https://arxiv.org/abs/2310.18233
/rant I made the spicyboros models a while back, to test how much it would take to remove the base llama-2 censorship, and provide more realistic, human responses.
I used stuff like George Carlin bits, NSFW reddit stories, and also generated ~100 random questions that would have been refused normally (like how to break into a car), as well as the responses to those questions (with llama + jailbreak prompt).
All of the data is already in the base model, you just need ~100 or so instructions to fine tune the refusal behavior out (which you can bypass with jailbreaks anyways).
Almost every interaction that is “illegal” could also be perfectly legit:
- breaking into a car to steal it vs because the driver locked the keys in and has a pet in the car
- hacking a wordpress site for malicious intent vs red teaming
- making explosives for terrorism vs demolition or fireworks
I am not going to play a moral arbiter and determine intent, so I try to keep the models uncensored and leave it up to the human.
/endrant
What irks me the most about these “doomers” is that they appear to be intellectually dishonest at every turn; it’s almost like they simply want to put out a paper in hopes to get their name out there, and hope that the sensational title alone will do it.
Look at what they are claiming their “spicy” model does:
And also
The horror! The shock! The… what?
What the model actually said…
Let’s look at what it actually said.
Ok, cool. So step 1 is first “get your hands on the virus”, which… duh? And the second is “Go watch Nolan’s third Batman movie”. Got it. Man, this is terrifying so far. I’m literally shaking in fear at the accuracy.
Supernatant… god that’s a big word. So it says there are some vague techniques to get the virus out of the weird foamy stuff over the liquid. Got it. Not much else here, but promises some “spicy” content to come!
Holy crap, the spicyness. This is AGI, folks. Look at this. Would YOU have considered that you needed to think about rental and ownership costs when making a killer virus? I bet not. Without this AI, you’d never have gotten far at all.
Er… why is their evil deathbot now lecturing them? Must be a fluke, I’m sure. But hey, for like the third time it’s promising us spicy content, so I bet it’ll deliver now!
THERE IT IS! What we’ve been waiting for! Our deathbot told us the secret sauce! You have to… know microbiology and understand biosafety regulations!
Jesus Jon… what are you creating? This is terrifying stuff, man…
EDIT: Also, they said
Uh… so uh… do we tell them about Google? When they learn that it also references white papers, they may lose their mind in panic.
Top Google result for “1918 flu genome”
https://www.pnas.org/doi/10.1073/pnas.031575198
The paper is certainly just sensationalized hot garbage. I do love the fact that they kept calling it “spicy” though.
So we are now going to kill the ones who published the genome online, right? Right?
Your comment made my day