Comment by nirvdrum
9 hours ago
For anyone else unfamiliar with the term:
RLHF = Reinforcement Learning from Human Feedback
https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...
9 hours ago
For anyone else unfamiliar with the term:
RLHF = Reinforcement Learning from Human Feedback
https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...
No comments yet
Contribute on Hacker News ↗