Comment by tigranbs
2 months ago
Somehow it writes bad React code and misses to check linting prompts half the time. But surprisingly, the Python coding was great!
2 months ago
Somehow it writes bad React code and misses to check linting prompts half the time. But surprisingly, the Python coding was great!
SWE-bench is all python. Hope is not overly optimized for it.