Comment by egypturnash

7 months ago

Is it time for a new benchmark of "how easy is it to turn this AI into a 4chan poster", maybe it is since this seems to be an axis that Elon seems to want to distinguish his AI offering from everyone else's along.

42 comments

egypturnash

notatoad 7 months ago

i don't think that's a new benchmark, it's a very old benchmark. Anybody who can't pass it hasn't exceeded the standard set by microsoft tay back in 2016

https://en.wikipedia.org/wiki/Tay_(chatbot)

tcmart14 7 months ago

I'll grant you that Tay's ability to turn into an utter shit show was phenomenal. However, IBM thinking it would be a good idea to give Watson the Urban dictionary holds a special place in my heart.
LeoPanthera 7 months ago

Microsoft did it accidentally. Musk is doing it deliberately. Big difference.

simonw 7 months ago

I was thinking it would actually be really interesting to take the Grok system prompt that was running when it went MechaHitler and try that (and a bunch of nasty prompts) against different models to see what happens.

skybrian 7 months ago

Yes, and I wonder if the recent research about "emergent misalignment" might be somehow related?
skocznymroczny 7 months ago

Well, it didn't really go MechaHitler. It was prompted with a question if it would rather be MechaHitler or GigaJew. The way LLMs and temperatures work you can reroll the answer and get either.

SkinTaco 7 months ago

Luckily we don't need a benchmark for "how easy is it to turn this AI into a bluesky poster", since they can all already do that

perching_aix 7 months ago
Wow that sure doesn't sound forced at all. Did blaming things on Reddit go out of fashion in your circles or something? Or was the pull of keeping to microblogging platforms just this strong?
- SkinTaco 7 months ago
  
  [flagged]
  
  26 replies →
- unethical_ban 7 months ago
  
  I wonder if that account knows how illogical and trollish they are, or if it comes so naturally they think they're intellectual.
moate 7 months ago
In your mind, what's a bluesky poster?
- SkinTaco 7 months ago
  
  [flagged]
  
  3 replies →