Comment by blazespin
3 hours ago
if you read the paper that is the intention, to guide stuff like lean.
i don't think llm is a great pure rlvr
3 hours ago
if you read the paper that is the intention, to guide stuff like lean.
i don't think llm is a great pure rlvr
No comments yet
Contribute on Hacker News ↗