Comment by Der_Einzige
3 days ago
You clearly didn't read the recent speculative decoding papers because it's been possible to use any model to speculate for any other model for awhile. They solved the tokenization problems that prevented this in the past.
No comments yet
Contribute on Hacker News ↗