Comment by snqb 21 hours ago how well does it do on frontier models like Opus 4.6? 1 comment snqb Reply GodelNumbering 21 hours ago I have only done functionality testing, no benchmark testing on Opus (decided to pay my rent instead)
GodelNumbering 21 hours ago I have only done functionality testing, no benchmark testing on Opus (decided to pay my rent instead)
I have only done functionality testing, no benchmark testing on Opus (decided to pay my rent instead)