Comment by MattRix

17 days ago

If you don’t think the quality has improved then you haven’t actually been trying it. Any programmer who knows what they’re doing can immediately tell models like Opus 4.6 and Codex 5.3 are much better than models from a year ago. All the objective metrics (benchmarks etc) agree as well.