Comment by articlepan
3 days ago
Title is bad, it's the first line of the abstract instead of the paper title. Speculative decoding for LLM inference was published in 2022: https://arxiv.org/abs/2211.17192
This paper seems to be an improvement to speculative decoding but I haven't read it yet.
No comments yet
Contribute on Hacker News ↗