← Back to context

Comment by directevolve

1 day ago

Punishing researchers who make mistakes or get unlucky due to noise in the data is a recipe for disaster, just like in other fields. The ideal amount of fraud and false claims in research is not zero, because the policing effort it would take to accomplish this goal would destroy all other forms of value. I can't emphasize enough how bad an idea blackballing researchers for publishing irreproducible results would be.

We have money to fund direct reproducibility studies (this one is an example), and indirect replication by applying othogonal methods to similar research topics can be more powerful than direct replication.

Completely agree.

Given the way that science and statistics work, completely honest researchers that do everything correct and don't make any mistakes at all will have some research that fails to reproduce. And the flip side of that is that some completely correct work that got the right answer, some proportion of the time, the reproduction attempt will incorrectly fail to reproduce. Type 1 and Type 2 errors are both real and occur without any need for misconduct or mistakes.

Well, don't forgot I also said this!

> With whatever sort of due process is needed to make this reasonable

Is it not reasonable to not continue to fund scientists whose results consistently do not reproduce? And should we not spend the funds to verify that they do (or don't) reproduce (rather than e.g. going down an incredibly expensive goose-chase like recently happened w/ Alzheimer's research)?

Currently there is more or less no reason not to fudge results; your chances of getting caught are slim, and consequences are minimal. And if you don't fudge your results, you'll be at a huge disadvantage when competing against everyone that does!

Hence the replication crises.

So clearly something must be done. If not penalyzing failures to reproduce and funding reproduction efforst, then what?

  • Your way of thinking sounds alien to me. You seem to assume that people mostly just follow the incentives, rather than acting according to their internal values.

    Science is a field with low wages, uncertain careers, and relatively little status. If you respond strongly to incentives, why would you choose science in the first place? People tend to choose science for other reasons. And, as a result, incentives are not a particularly effective tool for managing scientists.

    • Of course people will follow their own internal values in some cases, but we really want to arrange things so that the common and incentived path is the happy path!

      And without the proper systemic arrangements, people with strong internal values will just tend to get pushed out. E.g., an example from today's NY times: https://archive.is/wV4Sn

      I don't mean to seem too cynical about human nature; it's not so much that I don't think people with good motivations won't exist, it's that you need to create a broader ecosystems where those motivations are adaptive. Otherwise they'll just get pushed out.

      By analogy, consider a competitive sport, like bicycling. Imagine if it was just an honor system to not use performance enhancing drugs; even if 99% of cyclists were completely honest, the sport would still be dominated by cheaters, because you simply wouldn't be able compete without cheating.

      The dynamics are similar in science if you allow for bad research to go unchallenged.

      (PS: Being a scientist is very high-status! I can imagine very few things with as much cachet at a dinner-party as saying "I'm a scientist".)

      3 replies →

> The ideal amount of fraud and false claims in research is not zero, because the policing effort it would take to accomplish this goal would destroy all other forms of value.

Surely that just means that we shouldn't spend too much effort achieving small marginal progress towards that ideal, rather than that's not the ideal? I am a scientist (well, a mathematician), and I can maintain my idealism about my discipline in the face of the idea that we can't and shouldn't try to catch and stop all fraud, but I can't maintain it in the face of the idea that we should aim for a small but positive amount of fraud.

  • It's not actually "Ideal" is the point.

    You CANNOT create a system that has zero fraud without rejecting a HUGE amount of legitimate work/requests.

    This is as true for credit card processing as it is for scientific publishing.

    There's no such thing as "Reject 100% of fraud, accept 100% of non-fraud". It wouldn't be "ideal" to make our spaceships with anti-gravity drives, it would be "science fiction".

    The relationship between how hard you prevent fraud and how much legitimate traffic you let through is absurdly non-linear, and super dependent on context. Is there still low hanging fruit on the fraud prevention pipeline for scientific publishing?

    That depends. Scientists claim that having to treat each other as hostile entities would basically destroy scientific progress. I wholeheartedly agree.

    This should be obvious to anyone who has approved a PR from a coworker. Part of our job in code review is to prevent someone from writing code to do hostile things. I'm sure most of us put some effort towards preventing obvious problems, but if you've ever seen https://en.wikipedia.org/wiki/International_Obfuscated_C_Cod... or some of the famous bits of code used to hack nation states then you should recognize that the amount of effort it would take to be VERY SURE that this PR doesn't introduce an attack is insane, and no company could afford it. Instead, we assume that job interviews, coworker vibes, and reputation are enough to dissuade that attack vector, and it works for almost everyone except the juiciest targets.

    Science is a high trust industry. It also has "juicy targets" like "high temp superconductor" or "magic pill to cure cancer", but scientists approach everything with "extreme claims require extreme results" and that seems to do alright. They mostly treated LK-99 with "eh, let's not get hasty" even as most of the internet was convinced it was a new era of materials. I think scientists have a better handle on this than the rest of us.

    • > It's not actually "Ideal" is the point.

      > You CANNOT create a system that has zero fraud without rejecting a HUGE amount of legitimate work/requests.

      I think that we are using different definitions of "ideal." It sounds like your definition is something like "practically achievable," or even just "can exist in the real world," in which case, sure, zero fraud is not ideal in that sense. To check whether I am using the word completely idiosyncratically, I just looked it up in Apple Dictionary, and most of the senses seem to match my conception, but I meant especially "2b. representing an abstract or hypothetical optimum." It seems very clear to me that you would agree with zero fraud being ideal in sense "2a. existing only in the imagination; desirable or perfect but not likely to become a reality," but possibly we can even agree that it also fits sense 2b above.