Comment by timr
17 days ago
You've either missed the point of what I wrote, or you're arguing with someone else.
I'm talking about the difference between epsilon-greedy vs. a more complex optimization scheme within the context of implementing MAB. You're making arguments about A/B testing vs MAB.
One of us definitely misread something.
Let me spell out what I wrote for you:
* In most systems, the cost of implementing MAB >> AB
* The cost if implementing a more complex optimization scheme is only marginally higher than the cost of MAB
Therefore, if you're already going up to MAB, you might as well go all the way.