Comment by tomp
13 years ago
As far as I understand it, an advantage of the epsilon greedy algorithm is that it will relearn the best choice if it changes over time. Now, you could do that with a logarithmically-regretful algorithm as well, but it would take more time to relearn.
No comments yet
Contribute on Hacker News ↗