Comment by machiaweliczny
5 days ago
No it's not because cost is much lower. They do some kind of speculative decoding in monte-carlo way If I had to guess as humans do it this way is my hunch. What I mean it's kinda the way you describe but much more efficient.
No comments yet
Contribute on Hacker News ↗