Comment by freehorse
2 hours ago
Let’s say TTFT needed the most improvement. At some point, loading the model with enough context size may take tens of seconds in some macs.
2 hours ago
Let’s say TTFT needed the most improvement. At some point, loading the model with enough context size may take tens of seconds in some macs.
No comments yet
Contribute on Hacker News ↗