← Back to context

Comment by mohsen1

5 months ago

I wonder how many of the solutions that passes SWE-lancer evals would not be accepted by the poster due to low quality

I’ve been trying so many things to automate solving bugs and adding features 100% by AI and I have to admit it’s been a failure. Without someone that can read the code and fully understand the AI generated code and suggests improvements (SWE in the loop) AI code is mostly not good.