Comment by whimsicalism
6 days ago
I imagine you have to start decoding many speculative completions in parallel to have true low latency.
6 days ago
I imagine you have to start decoding many speculative completions in parallel to have true low latency.
No comments yet
Contribute on Hacker News ↗