Comment by int_19h
18 hours ago
"Forcing" here basically means giving initial instructions that clearly require passing the tests as a condition of finishing the work. The agent still works uninterrupted.
18 hours ago
"Forcing" here basically means giving initial instructions that clearly require passing the tests as a condition of finishing the work. The agent still works uninterrupted.
No comments yet
Contribute on Hacker News ↗