Comment by cortic

1 month ago

> ChatGPT (o3): Scored 136 on the Mensa Norway test in April 2025

So yes, most people are right in that assumption, at least by the metric of how we generally measure intelligence.

8 comments

cortic

Does an LLM scoring well on the Mensa test translate to it doing excellent and factual police reporting? It is probably not true of humans doing well on the Mensa, why would it be true of an LLM?

We should probably rigorously verify that, for a role that itself is about rigorous verification without reasonable doubt.

I can immediately, and reasonably, doubt the output of an LLM, pending verification.

gilrain 1 month ago

> the metric of how [the uninformed] generally measure intelligence

cortic 1 month ago

How do the informed measure intelligence?
I know I'm too late to ask this question, But I suspect its either; Feelings and intuitions, which is just a primitive IQ test. Or some kind of aptitude test, which is just a different flavor of IQ test.

vid 1 month ago

Court reports should as much be about human sensibility. I have met plenty of high IQ people who were insensitive.

cortic 1 month ago
Having listened to some the new AI generated songs on utube, looks like they might be better at being sensitive humans than we are as well..
- gilrain 1 month ago
  
  Where do you imagine they copied those human sensitivities from? The weather?
  
  1 reply →

turtlesdown11 1 month ago

Yeah I certainly associate LLMs with high intelligence when they provide fake links to fake information, I think, man this thing is SMART