Comment by michaelmrose

1 day ago

What makes you say it has preferences without any meaningful persistent model of self or anything else?

The conversation chain can count as persistent, but this doesn't impact preference though. Give the model an ambiguous request, it's output will fill the gaps, if this is consistent enough, it can be regarded as its "preference".

  • It isn't a preference because it doesn't have them because it doesn't have a meaningful interior life that anyone has demonstrated.

    • If you ask it, (there is always some randomness to these models but removing all other variables) it consistently leans to one idea in it's output, that is its preference. It is learned during training. Speaking abstractly that is its latent internal viewpoint. It may be static, expressed in its model weights but it's there.