Comment by tosh

5 hours ago

This might also hint at SWE struggling to capture what “being good at coding” means.

Evals are hard.

> This might also hint at SWE struggling to capture what “being good at coding” means.

My take would be that coding itself is hard, but I'm a software engineer myself so I'm biased.