Comment by immibis
2 months ago
But not the tokens that don't even feed into your output because they're feeding into someone else's output. Separate items in batches don't get mixed up with each other - they just run the model separately on each item at the same time, like SIMD.
No comments yet
Contribute on Hacker News ↗