Comment by avereveard

13 hours ago

I use glm for all code investigations and top level system design of all kinds, and then present finding to confirm and act upon to opus. everything that burns token goes there.

the finding aren't always accurate, but it saves ton of opus token

likewise I have google ai from my photo storage, so I give claude / opencode a skill that uses gemini (agy now) command line for web searches, using their flash model line.