Comment by goldenarm

4 hours ago

The non-hallucination rate in AA-omniscience is SOTA, better than Opus 4.7, Gemini 3.1 Pro and GPT5.5! Congrats to the team

5 comments

goldenarm

> The non-hallucination rate in AA-omniscience is SOTA

Note that a perfect "non-hallucination rate" is rather meaningless as such tests can contain human hallucinations.

It means the model aligns with the possibly-true, possibly-false beliefs of the group that made the test.

rlt 1 hour ago

Well, yes, garbage in garbage out. That's a given and not what's meant by "hallucination" in this context.

referencing this:

(had to add it to the chart, wasn't displayed by default. is it the lowest rate in the datasetor no?)

Truly incredible! Very impressed by their progress. I wonder how much of their own chips did they use for training.

wonder at which level there's a capability state transition? 5%? 1%?