Comment by behnamoh
2 months ago
tok/s speeds in the video:
- 1st message (empty context): 857 tok/s
- 2nd message (2244 tokens in context): 727 tok/s
- 3rd message (2244+1398 tokens in context): 693 tok/s
I'm no expert in diffusion models but this looks like a drastic drop in speed, especially in longer chats (this was just 3 messages).
No comments yet
Contribute on Hacker News ↗