Comment by blobbers
4 days ago
Is this something we could build into post training?
Some kind of RL portion of the code that reinforces de-escalation, dangers of war, nuclear destruction of both AI and human kind, radiation and it's dangers towards microchips, the atmosphere and bit flipping (just so the AI doesn't get cocky!)
No comments yet
Contribute on Hacker News ↗