Comment by Davidzheng

1 day ago

your argument assumes that they don't believe in model welfare when they explicitly hire people to work on model welfare?

19 comments

Davidzheng

itsalotoffun 1 day ago

While I'm certain you'll find plenty of people who believe in the principle of model welfare (or aliens, or the tooth fairy), it'd be surprising to me if the brain-trust behind Anthropic truly _believed_ in model "welfare" (the concept alone is ludicrous). It makes for great cover though to do things that would be difficult to explain otherwise, per OP's comments.

meowface 19 hours ago
The concept is not ludicrous if you believe models might be sentient or might soon be sentient in a manner where the newly emerged sentience is not immediately obvious.
Do I think that or think even they think that? No. But if "soon" is stretched to "within 50 years", then it's much more reasonable. So their current actions seem to be really jumping the gun, but the overall concept feels credible.
- perihelions 19 hours ago
  
  It's lazy to believe that humanity's collective decision-making would, in the future, protect AI's merely for being conscious beings. The tech economy *today* runs on the slave labor of humans, in foreign, third-world countries. All humanity needs to do is draw a line, push the conscious AI's outside that line, and declare, "not our problem anymore!" That's what we do today, with humans. That is the human condition.
  Show me a tech company that lobbies for "model welfare" for conscious human models enslaved in Xinjiang labor camps, building their tech parts. You know what—actually most of them lobby against that[0]. The talk hurts their profits. Does anyone really think, that any of them would blink about enslaving a billion conscious AI's to work for free? That faced with so much profit, the humans in charge would pause, and contemplate abstract morals?
  [0] https://www.washingtonpost.com/technology/2020/11/20/apple-u... ("Apple is lobbying against a bill aimed at stopping forced labor in China")
  Maybe humanity will be in a nicer place in the future—but, we won't get there by letting (of all people!) tech-industry CEO's lead us there: delegating our moral reason to these people who demand to position themselves as our moral leaders.
  
  1 reply →
mike_hearn 13 hours ago

Why would they post a whole blog post about it then? They even say they aren't certain as to the moral status of LLMs, implying this is a topic of live debate inside the company.
None of this is in any way surprising, in fact I wrote an essay predicting this direction back in 2022:
https://blog.plan99.net/the-looming-ai-consciousness-train-w...
joemazerino 11 hours ago

Model welfare is a section in every Anthropic safety score card.

ceejayoz 1 day ago

You must think Zuckerberg and Bezos and Musk hired diversity roles out of genuine care for it, then?

comp_throw7 1 day ago
This is a reductive argument that you could use for any role a company hires for that isn't obviously core to the business function.
In this case you're simply mistaken as a matter of fact; much of Anthropic leadership and many of its employees take concerns like this seriously. We don't understand it, but there's no strong reason to expect that consciousness (or, maybe separately, having experiences) is a magical property of biological flesh. We don't understand what's going on inside these models. What would you expect to see in a world where it turned out that such a model had properties that we consider relevant for moral patienthood, that you don't see today?
- ceejayoz 1 day ago
  
  They know full well models don’t have feelings.
  The industry has a long, long history of silly names for basic necessary concepts. This is just “we don’t want a news story that we helped a terrorist build a nuke” protective PR.
  They hire for these roles because they need them. The work they do is about Anthropic’s welfare, not the LLM’s.
  
  6 replies →
- bawolff 19 hours ago
  
  In fairness though, this is what you are selling - "ethical AI". In order to make that sale you need to appear to believe in that sort of thing. However there is no need to actually believe.
  Whether you do or don't I have no idea. However if you didn't you would hardly be the first company to pretend to believe in something for the sale. Its pretty common in the tech industry.
- AlecSchueler 18 hours ago
  
  > This is a reductive argument that you could use for any role
  Isn't that fair in taking to an equally reductive argument that could be applied to any role?
  The argument was that their hiring for the role shows they care, but we know from any number of counter examples that that's not necessarily true.
- coderatlarge 19 hours ago
  
  extending that line of thought would suggest that anthropic wouldn’t turn off a model if it cost too much to operate which clearly it will do. so minimally it’s an inconsistent stance to hold.

raincole 20 hours ago

Sounds like a very reasonable assumption to me.