← Back to context

Comment by ludwik

7 months ago

There is "performance" as in "speed and cost" and performance as in "the model returning quality responses, without getting lost in the weeds". Caching only helps with the former.

"the model returning quality responses, without getting lost in the weeds"

I should edit, but that would be disingenuous. This is exactly what I meant.

thank you!