Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by ethbr1

18 days ago

Isn't that just RL with extra power-intensive steps? (An entire model chugging away in the goal function)

2 comments

ethbr1

Reply

hrn_frs  17 days ago

That's correct, but if successful you'd essentially have updated the LLM's knowledge and capabilities "on the fly".

  • ethbr1  17 days ago

    Maybe we could run off-peak load of that nature, when power is cheaper. Call it dreaming. ;)

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities