Comment by anoncareer0212
15 days ago
Hmmm, I might be rounding off wrong? Or reading it wrong?
IIUC the data we have:
2K tokens / 12 seconds = 166 tokens/s prefill
120K tokens / (10 minutes == 600 seconds) = 200 token/s prefill
15 days ago
Hmmm, I might be rounding off wrong? Or reading it wrong?
IIUC the data we have:
2K tokens / 12 seconds = 166 tokens/s prefill
120K tokens / (10 minutes == 600 seconds) = 200 token/s prefill
No comments yet
Contribute on Hacker News ↗