← Back to context

Comment by tbrownaw

7 months ago

So... the real way to implement AI safety is just to exclude that genre of fiction from the training set?

Given that 90% of “ai safety” is removing “bias” from training data, it does follow logically that if removing racial slurs from training to make a non-racist ai is an accepted technique, removing “bad robot” fiction should work just as well.

(Which is an implicit criticism of what passes for “safety” to be clear).