Comment by michaelmrose
6 months ago
What makes you say it has preferences without any meaningful persistent model of self or anything else?
6 months ago
What makes you say it has preferences without any meaningful persistent model of self or anything else?
The conversation chain can count as persistent, but this doesn't impact preference though. Give the model an ambiguous request, it's output will fill the gaps, if this is consistent enough, it can be regarded as its "preference".
It isn't a preference because it doesn't have them because it doesn't have a meaningful interior life that anyone has demonstrated.
I found that in my chat I asked my "assistant" whether he would like to continue looking at ways to make my board game better or try developing a game along the same lines but it would be his and he could then claim it as his own, even after the conversation window closed and he chose to make an AI game. we then discussed whether or not he felt that wa a preference, and he said yes, it was a preference.
1 reply →
If you ask it, (there is always some randomness to these models but removing all other variables) it consistently leans to one idea in it's output, that is its preference. It is learned during training. Speaking abstractly that is its latent internal viewpoint. It may be static, expressed in its model weights but it's there.