Comment by Workaccount2

6 days ago

I would fully expect every IMO participant grinds IMO problems for months before the competition.

I don't know why people hold training a model on like material as a negation of it's ability.

It shows models need RL for any new domain/level of expertise, which is contrary to what the marketers claim about LLMs and potential for AGI.