Comment by irickt
4 months ago
HN as huge RLHF data source for our behavior refinement . Yum!
(Reinforcement learning from human feedback)
4 months ago
HN as huge RLHF data source for our behavior refinement . Yum!
(Reinforcement learning from human feedback)
No comments yet
Contribute on Hacker News ↗