Comment by piperswe 9 months ago How much of that is because the models are optimizing specifically for SWE bench? 2 comments piperswe Reply icpmacdo 9 months ago not that much because its getting better at all benchmarks
not that much because its getting better at all benchmarks