Comment by mmmore
2 months ago
Sonnet with extended thinking solved it after 30s for me:
https://claude.ai/share/b974bd96-91f4-4d92-9aa8-7bad964e9c5a
Normal Opus solved it:
https://claude.ai/share/a1845cc3-bb5f-4875-b78b-ee7440dbf764
Opus with extended thinking solved it after 7s:
https://claude.ai/share/0cf567ab-9648-4c3a-abd0-3257ed4fbf59
Though it's a weird puzzle to use a benchmark because the answer is so formulaic.
It is formulaic which is why it surprised me that Sonnet failed it. I don't have access to the other models so I'll stick with Gemini for now.