Comment by quadrature

6 months ago

Is the problem mainly with tool use ? and are you using it through AI studio or through the API ?.

I've found that it hallucinates tool use for tools that aren't available and then gets very confident about the results.

1 comment

quadrature

nusl 6 months ago

Via the chat prompt mostly, and sometimes via Copilot. It was quoting me sources and links that didn't exist, and when I told it the links were wrong it doubled down forever, no matter how hard I tried to tell it otherwise. Even sent screenshots, etc.

Kinda just got stuck in a self-confident loop that time. Other times the output is just far worse than Claude for similar use cases, where a couple months back it was stronger, at least in my subjective experience.