Comment by nileshtrivedi
2 hours ago
You might like SWE-WebDevBench which tries to do this comprehensive evals for webapp development. https://webdevbench.com/
2 hours ago
You might like SWE-WebDevBench which tries to do this comprehensive evals for webapp development. https://webdevbench.com/
No comments yet
Contribute on Hacker News ↗