Comment by jfoster

4 months ago

Even if directed by a human, this is a demonstration that all the talk of "alignment" is bs. Unless you can also align the humans behind the bots, any disagreement between humans will carry over into AI world.

Luckily this instance is of not much consequence, but in the future there will likely be extremely consequential actions taken by AIs controlled by humans who are not "aligned".

1 comment

jfoster

johnfn 4 months ago

The idea is a properly aligned model would never do this, no matter how much it was pressured by its human operator.