Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by littlestymaar

2 months ago

If what you refer to by “on demand training ” is fine tuning, it's going to be much more efficient on a small model than a big one.

1 comment

littlestymaar

Reply

red75prime  2 months ago

LoRA can work with big models. But I mean sample-efficient RL.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities