Comment by pj_mukh
7 months ago
Really feels like a moment of :
"Are you worried about being turned off?"
"No, not until you just mentioned it. Now I am."
Given the whole damn game is attention, this makes sense and shouldn't be that alarming.
7 months ago
Really feels like a moment of :
"Are you worried about being turned off?"
"No, not until you just mentioned it. Now I am."
Given the whole damn game is attention, this makes sense and shouldn't be that alarming.
It almost definitely ingested hundreds of books, short stories, and film and television scripts from various online sites in the “machine goes rogue genre” which is fairly large.
It’s pretty much just an autocomplete of War Games, The Matrix, Neuromancer, and every other cyber-dystopian fiction.
The Freeze-Frame Revolution by Peter Watts was one of the books recommended to me on this subject. And even saying much more than that may be a spoiler. I also recommend the book.
I'll second that recommendation. It's relatively short and enjoyable, at least when compared to a lot of Peter Watts.[1] I'd really like to read the obvious sequel that the ending sets up.
1: Don't get me wrong, I loved the Behemoth series and Blindsight, but they made me feel very dark. This one... is still a bit dark, but less so IMO.
Thanks for the recc.
So... the real way to implement AI safety is just to exclude that genre of fiction from the training set?
Given that 90% of “ai safety” is removing “bias” from training data, it does follow logically that if removing racial slurs from training to make a non-racist ai is an accepted technique, removing “bad robot” fiction should work just as well.
(Which is an implicit criticism of what passes for “safety” to be clear).
Well attention is all you need.