Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by mindwok

6 days ago

Not necessarily. If the RL objective is passing tests then in the context of LLMs it means "correct", or at least "correct based on the tests".

1 comment

mindwok

Reply

otabdeveloper4  6 days ago

Unfortunately that doesn't solve the problem in any way. We don't have an Oracle machine for testing software.

If we did, we could autogenerate code even without an LLM.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities