Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by clickety_clack

1 day ago

You can train a tokenizer on old data just like you can train a model on old data.

2 comments

clickety_clack

Reply

wongarsu  20 hours ago

But you can't use an old model with a new tokenizer. Changing the tokenizer implies you trained the model from scratch

  • dannyw  14 hours ago

    A little bit of post-training will fix that. Folks on /r/LocalLLaMa have been making effective finetunes with diff. tokenizers for years.

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities