← Back to context Comment by burkaman 6 hours ago It's actually 80% against Opus, 66% average against the 5 models it's tested with. 0 comments burkaman Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗