← Back to context Comment by piperswe 2 months ago How much of that is because the models are optimizing specifically for SWE bench? 2 comments piperswe Reply icpmacdo 2 months ago not that much because its getting better at all benchmarks
not that much because its getting better at all benchmarks