Comment by blazespin
2 months ago
if you read the paper that is the intention, to guide stuff like lean.
i don't think llm is a great pure rlvr
2 months ago
if you read the paper that is the intention, to guide stuff like lean.
i don't think llm is a great pure rlvr
No comments yet
Contribute on Hacker News ↗