Comment by porphyra
20 hours ago
I think Atlas might also be slightly faster than vLLM:
https://flowtivity.ai/blog/120-tok-s-1m-context-private-ai-d...
20 hours ago
I think Atlas might also be slightly faster than vLLM:
https://flowtivity.ai/blog/120-tok-s-1m-context-private-ai-d...
No comments yet
Contribute on Hacker News ↗