Comment by curioussquirrel
8 hours ago
You're right. There was also an experiment in Meta which tokenized bytes directly and it didn't hurt performance much in very small models.
8 hours ago
You're right. There was also an experiment in Meta which tokenized bytes directly and it didn't hurt performance much in very small models.
No comments yet
Contribute on Hacker News ↗