Comment by irickt
2 months ago
HN as huge RLHF data source for our behavior refinement . Yum!
(Reinforcement learning from human feedback)
2 months ago
HN as huge RLHF data source for our behavior refinement . Yum!
(Reinforcement learning from human feedback)
No comments yet
Contribute on Hacker News ↗