Comment by fooker
13 hours ago
This is how language models have worked since their inception, and has been steadily improved since about 2018.
See embedding models.
> they removed the tokenizer altogether
This is an active research topic, no real solution in sight yet.
No comments yet
Contribute on Hacker News ↗