Comment by lossolo
1 hour ago
Well, there are multiple token proposals processed in parallel, from which only one is picked, seems like branching to me. The only difference is that in case of CPU there is always only one possible branch that is correct.
No comments yet
Contribute on Hacker News ↗