Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library

Comment by verdverm

20 days ago

getting tricks embedded into the weights is expensive, it doesn't happen in a single pass

they's why we teach them new tricks on the fly (in-context learning) with instruction files

7 comments

verdverm

Reply

catlifeonmars  20 days ago

Right, it sounds like an artificial limitation.

  • verdverm  20 days ago

    it's more a mathematical / algorithmic limitation

    • catlifeonmars  19 days ago

      I’ll counter it’s an architectural issue

      2 replies →

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities