← Back to context

Comment by esafak

2 hours ago

Hear, hear. Even if the model fits, a few tokens per second make no sense. Time is money too.

Maybe for a coding agent, but a daily/weekly report on sensitive info?

If it were 2016 and this technology existed but only in 1 t/s, every company would find a way to extract the most leverage out of it.