← Back to context

Comment by LeoPanthera

3 months ago

Hopefully obviously, by testing it against objective facts which are nonetheless "controversial" politically.

In the end many of these are "political facts" and not objective like what year was a person born in. The answer to your question is as simple as - come up with the actual list of "facts", and then run a simple eval with every model on them.

The implementation is trivial - the listing down of "political facts" is the hard part.