Comment by HarHarVeryFunny

1 year ago

To the extent we've now got the output of the underlying model wrapped in an agent that can evaluate that output, I'd expect it to be able to detect it's own hallucinations some of the time and therefore provide an alternate answer.

It's like when an LLM gives you a wrong answer and all it takes is "are you sure?" to get it to generate a different answer.

Of course the underlying problem of the model not knowing what it knows or doesn't know persists, so giving it the ability to reflect on what it just blurted out isn't always going to help. It seems the next step is for them to integrate RAG and tool use into this agentic wrapper, which may help in some cases.

0 comments

HarHarVeryFunny

No comments yet

Contribute on Hacker News ↗