Comment by nileshtrivedi
3 hours ago
You might like SWE-WebDevBench which tries to do this comprehensive evals for webapp development. https://webdevbench.com/
3 hours ago
You might like SWE-WebDevBench which tries to do this comprehensive evals for webapp development. https://webdevbench.com/
No comments yet
Contribute on Hacker News ↗