← Back to context

Comment by mark_l_watson

2 days ago

I don’t use it often either, but sometimes it is very useful. When I caught Covid last fall my wife incorrectly thought I had it three times. I was using a beta Google Gemini, and paying for it, and I asked “read my @gmail and tell me the date ranges when I have had Covid.”

That worked, but to be honest I have tried similar things more recently that didn’t work. Perhaps there is a routing model up front that decides whether or not to use a lot of compute for any given query?

Google also plans on charging more money for APIs for code completion plugins for IntelliJ IDs, etc. this year.

I would like to see AI pricing models be sustainable, not give things away for free, and have lots of control over when I use a lot of compute. I actually have this right now because I usually use LLM APIs and write my own agents for specific tasks.