Comment by irickt
2 days ago
HN as huge RLHF data source for our behavior refinement . Yum!
(Reinforcement learning from human feedback)
2 days ago
HN as huge RLHF data source for our behavior refinement . Yum!
(Reinforcement learning from human feedback)
No comments yet
Contribute on Hacker News ↗