Comment by whinvik
7 hours ago
When we were trying to build our own agents we put quite a bit of effort on evals which was useful.
But switching over to using coding agents we never did the same. Feels like building an eval set will be an important part of what engg orgs do going forward.
No comments yet
Contribute on Hacker News ↗