Comment by szundi
9 hours ago
I don’t know what people are doing but Minimax produced 16 bugreports which of 15 was false positives (literally a mistake).
In contrast ChatGPT 5.3 and also Opus has a 90% rate at least on this same project. (Embedded)
All other tests were the same. What are you doing with these models?
No comments yet
Contribute on Hacker News ↗