← Back to context

Comment by simonw

7 hours ago

I wonder why these Anthropic researchers chose GPT-4o for their study.

This is really strange and warrants some skepticism

  • Anthropic paid a team to do a project, and gave them leeway to do it how they wanted. If anything, it's a good signal that Anthropic didn't lean on the scale to have the results go in their favor.

    • Isn’t it technically in their favor if competition is proven bad, even if it would be equally easy to prove their product likely equally bad or even worse?