Comment by stevula

14 hours ago

I think that is how the smarter agents do things? Just like Claude/ChatGPT sometimes does a web search they can do other tool calls instead of just making a statistical guess. Of course it doesn’t always make the bright choice between those options though…

4 comments

stevula

fipar 13 hours ago

They will also lie and produce output saying it is based on tool execution, without having actually used the tool.

Yes, another layer to cross-check, say, “in kubectl logs I see …” with an actual k8s tool call can help, that is, when the cross-check layer doesn’t lie.

For the time being, IMHO, human validation in key points is the only way to get good results. This is why the tools make experienced people potentially a lot more efficient (they are quick to spot errors/BS) and inexperienced people potentially more dangerous (they’re more prone to trusting the responses, since the tone is usually very professionally sounding).

WalterBright 14 hours ago

> it doesn’t always make the bright choice

I'm available for a small fee.

sgc 13 hours ago

You must be living in absolute opulence :)
TeamGTN 3 hours ago

You should raise your price