Comment by janalsncm
5 days ago
We typically would solve a lot of the same types of problems with RL today because it’s more efficient.
In EA if a candidate fails we throw it away. In RL we learn from that experience.
RL gets harder when rewards are really sparse. OpenAI developed evolution strategies which is a bit of a hybrid.
No comments yet
Contribute on Hacker News ↗