Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by Workaccount2

6 days ago

I would fully expect every IMO participant grinds IMO problems for months before the competition.

I don't know why people hold training a model on like material as a negation of it's ability.

1 comment

Workaccount2

Reply

meroes  5 days ago

It shows models need RL for any new domain/level of expertise, which is contrary to what the marketers claim about LLMs and potential for AGI.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities