Comment by highfrequency
3 days ago
Great respect for Ilya, but I don’t see an explicit argument why scaling RL in tons of domains wouldn’t work.
3 days ago
Great respect for Ilya, but I don’t see an explicit argument why scaling RL in tons of domains wouldn’t work.
I think that scaling RL for all common domains is already done to death by big labs.
Not sure why they care about his opinion and discard yours.
They’re just as valid and well informed.
doesnt RL by definition not generalize? thats Ilya's entire criticism of the current paradigm