Comment by biophysboy
6 days ago
Could you not cache the top k outputs given a provided input token set? I thought the randomness was applied at the end by sampling the output distribution.
6 days ago
Could you not cache the top k outputs given a provided input token set? I thought the randomness was applied at the end by sampling the output distribution.
No comments yet
Contribute on Hacker News ↗