Comment by kqr

3 years ago

The very idea underlying statistics (that we can learn something useful from a class of events by ignoring specifically that which makes them different from each other) is a very recent invention -- only a few hundred years old -- and it's still counter-intuitive to a lot of people.

Ask people to predict whether the Jones' have a dog and they'll start asking questions about the specifics of the Jones', like do they have children, do they live in the suburbs, have they always wanted a dog, etc. Then they try to construct a narrative based on "logical" conclusions.

When the right approach might be to ask "what's the background rate of dog ownership in the Jones' country of residence?" But that takes ignoring the specifics, and that doesn't come easily to people. It took several millennia for anyone to realise it could be useful.

4 comments

kqr

smallnamespace 3 years ago

> Ask people to predict whether the Jones' have a dog and they'll start asking questions about the specifics of the Jones', like do they have children, do they live in the suburbs, have they always wanted a dog, etc. Then they try to construct a narrative based on "logical" conclusions.

This is the core idea of Bayesian statistics.

You fall back to the background rate only if you have no ability to access more pertinent facts.

kqr 3 years ago
When people ask about whether the Jones' have children, they do not (most of the time) have an accurate conditional probability of dog ownership given a certain number of children. They are trying to construct a story that seems plausible.
The difference between Bayesian reasoning and narrative fallacy is a coherent evaluation of joint probabilities -- the bit most people skip over.
- smallnamespace 3 years ago
  
  It's doubtful people have access to population-level base rates either, in which case they would also arrive at the wrong answer, no narrative needed.
  But why do you point the finger at narrative instead of a simple lack of data? In the dog example, those questions are absolutely good ones to ask, so long as the data can be found.
  One reason to reject this mental opposition between 'Bayesian reasoning' and 'narrative fallacy' is because narratives are essential to coming up with good conditions on which to split the population in the first place.
  'I would ask whether they have children, because children like dogs and therefore it's plausible that families with children have a higher rate of dog ownership' is a reasonable narrative that justifies collecting an extra column in your dataset.
  Granted, one shouldn't accept that story uncritically without checking against the actual statistics. But narratives themselves are unavoidable even when you're doing statistics correctly!
  
  1 reply →