Comment by visarga
3 years ago
You loop the LLM with code execution, or a simulator, or a game environment. And use feedback to learn.
3 years ago
You loop the LLM with code execution, or a simulator, or a game environment. And use feedback to learn.
No comments yet
Contribute on Hacker News ↗