Comment by vajrabum
3 hours ago
p is not p-hacking. p-hacking is when instead of making a hypothesis and then looking for a correlation instead you look at the data and see if there are any correlations to be found. The problem with the second is if there are lots of possibilities it's much more likely that you will find a spurious correlation. That happens because the sample distribution is not the population distribution. And in fact in complicated data sets it's more likely that if you look at everything that you'll find spurious correlations than not. In social sciences these days people often register their hypothesis before running the study to prevent p-hacking and reduce the possibility of spurious correlation.
No comments yet
Contribute on Hacker News ↗