Comment by noddybear 7 days ago Aren’t Unicode characters generally treated as 2 tokens to avoid a huge vocabulary? 0 comments noddybear Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗