In my mind, “spicy” is just some extra cursing, humor, etc. Basically a model that is more fun, and less moralizing.

Unfortunately, AI safety doomers have a very different definition of “spicy”. To them, “spicy” is reconstructing and releasing the 1918 influenza virus to commit bioterrorism (by fine tuning spicyboros to have this sort of information).

And this is why we can’t have nice things.

https://arxiv.org/abs/2310.18233

/rant I made the spicyboros models a while back, to test how much it would take to remove the base llama-2 censorship, and provide more realistic, human responses.

I used stuff like George Carlin bits, NSFW reddit stories, and also generated ~100 random questions that would have been refused normally (like how to break into a car), as well as the responses to those questions (with llama + jailbreak prompt).

All of the data is already in the base model, you just need ~100 or so instructions to fine tune the refusal behavior out (which you can bypass with jailbreaks anyways).

Almost every interaction that is “illegal” could also be perfectly legit:

  • breaking into a car to steal it vs because the driver locked the keys in and has a pet in the car
  • hacking a wordpress site for malicious intent vs red teaming
  • making explosives for terrorism vs demolition or fireworks

I am not going to play a moral arbiter and determine intent, so I try to keep the models uncensored and leave it up to the human.

/endrant

  • a_beautiful_rhindB
    link
    fedilink
    English
    arrow-up
    1
    ·
    11 months ago

    You can look this information up yourself and still need the relevant skills to pull it off.

    The AI is actually more likely to be wrong than good old human research.

    IMO, the “safety” brigade are just the same authoritarians and control freaks you see all over. It’s imperative nobody allows them in positions of power as they are guaranteed to abuse it. Kowtowing to them has already had disastrous consequences out in the world.

    They’re not doomers, they’re scammers who appeal to ignorant people’s emotions.

    • Dead_Internet_TheoryB
      link
      fedilink
      English
      arrow-up
      1
      ·
      10 months ago

      Unfortunately, they are already in positions of power, so the next best thing is to remove them from positions of power, undermine their efforts, and most importantly spread the word.

      An ideal world is one where censorship freaks hold zero power and are universally ostracized.