Comment by jablongo

3 hours ago

It’s not clear if we can draw any conclusions from this. Each run is like a single rollout of the LLM, which may meander into different themes or modalities chaotically. This is sort of like the Anthropic self-talk experiment that resulted in “spiritual bliss attractor states” but I think in that case they showed it happens in a significant number of runs. There was just one run per setup so this could all be random noise / the destination of a random walk of topics…