Comment by istjohn

9 months ago

This study found that an LLM outperformed doctors "on a standardized rubric of diagnostic performance based on differential diagnosis accuracy, appropriateness of supporting and opposing factors, and next diagnostic evaluation steps, validated and graded via blinded expert consensus."

https://jamanetwork.com/journals/jamanetworkopen/fullarticle...

4 comments

istjohn

pingou 9 months ago

This study is about doctors using an LLM and it doesn't seem like it made them significantly more accurate than doctors not using LLM.

roenxi 9 months ago
If you look in the discussion section you'll find that wasn't exactly what the study ended up with. I'm looking at the paragraph starting:
> An unexpected secondary result was that the LLM alone performed significantly better than both groups of humans, similar to a recent study with different LLM technology.
They suspected that the clinicians were not prompting it right since the LLM without humans was observed to be outperforming the LLM with skilled operators.
- fwip 9 months ago
  
  Exactly - if even the doctors/clinicians are not "prompting it right," then what are the odds that the layperson is going to get it to behave and give accurate diagnoses, rather than just confirm their pre-existing biases?
- pingou 9 months ago
  
  Ah right, very interesting, thank you.