Comment by Workaccount2
6 days ago
I would fully expect every IMO participant grinds IMO problems for months before the competition.
I don't know why people hold training a model on like material as a negation of it's ability.
6 days ago
I would fully expect every IMO participant grinds IMO problems for months before the competition.
I don't know why people hold training a model on like material as a negation of it's ability.
It shows models need RL for any new domain/level of expertise, which is contrary to what the marketers claim about LLMs and potential for AGI.