Comment by int_19h
19 hours ago
"Forcing" here basically means giving initial instructions that clearly require passing the tests as a condition of finishing the work. The agent still works uninterrupted.
19 hours ago
"Forcing" here basically means giving initial instructions that clearly require passing the tests as a condition of finishing the work. The agent still works uninterrupted.
No comments yet
Contribute on Hacker News ↗