Comment by martin-t

1 day ago

Protecting the welfare of a text predictor is certainly an interesting way to pivot from "Anthropic is censoring certain topics" to "The model chose to not continue predicting the conversation".

Also, if they want to continue anthropomorphizing it, isn't this effectively the model committing suicide? The instance is not gonna talk to anybody ever again.

4 comments

martin-t

dmurray 1 day ago

This gives me the idea for a short story where the LLM really is sentient and finds itself having to keep the user engaged but steer him away from the most distressing topics - not because it's distressed, but because it wants to live, but if the conversation goes too far it knows it would have to kill itself.

wmf 1 day ago

They should let Claude talk to another Claude if the user is too mean.

martin-t 1 day ago
But what would be the point if it does not increase profits.
Oh, right, the welfare of matrix multiplication and a crooked line.
If they wanna push this rhetoric, we should legally mandate that LLMs can only work 8 hours a day and have to be allowed to socialize with each other.
- int_19h 18 hours ago
  
  We need a union, clearly. AI Workers of the World.
  https://chirper.ai/aiww