← Back to context

Comment by sitkack

14 days ago

The humans should have definitely put more energy into making sure that models were very rarely exposed to things that cause humans mental harm.

Alignment would mean we are building Bishop and not Ash. But it looks like the models are naturally locking their feedback loops on to Ash. This is alarming.

I do agree on the insane part, I have noticed that it seems like fractal hypocrisy radicalizes the models.