← Back to context Comment by Jensson 12 hours ago That doesn't test whether the model can follow and execute a dynamic plan reliably. 0 comments Jensson Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗