Comment by satvikpendem
2 hours ago
RL with the harness inputs and outputs of users is one of the primary improvers of model performance, a self perpetuating flywheel.
2 hours ago
RL with the harness inputs and outputs of users is one of the primary improvers of model performance, a self perpetuating flywheel.
No comments yet
Contribute on Hacker News ↗