Comment by leerob
4 hours ago
(I work at Cursor) CursorBench includes many evals from actual engineering tasks from the Cursor team, which include our private codebase. This codebase is held-out from training so models haven't seen it, including Composer.
No comments yet
Contribute on Hacker News ↗