Researchers at Harvard University and Harvard Medical School found that an AI system outperformed emergency-room physicians in early diagnosis and triage.
This development suggests a potential shift in how critical conditions are identified in high-pressure medical environments. By accelerating the triage process, AI could improve patient outcomes through faster intervention.
The study utilized OpenAI's o1 model to evaluate diagnostic accuracy in U.S. emergency rooms [1]. According to the findings, the AI model correctly identified or closely matched the diagnosis in 67% of cases [1].
In comparison, attending physicians correctly identified or closely matched the diagnosis in 50% to 55% of cases [1]. The data indicates that the AI system was able to identify critical conditions faster than the human medical staff, a gap that could be vital in life-threatening scenarios [2].
While the AI showed strength in initial triage, other reports indicate the system still lags in complex research reasoning [2]. The Harvard researchers focused specifically on the early stages of patient care, where rapid identification of a condition is the primary goal [1].
Medical professionals have long debated the integration of large language models into clinical workflows. The results from this study highlight the ability of the o1 model to process emergency data and provide a diagnostic starting point that exceeds the baseline performance of attending physicians [1].
“AI model correctly identified or closely matched the diagnosis in 67% of cases”
The disparity between the AI's 67% accuracy and the physicians' 50-55% rate suggests that AI may serve as an effective decision-support tool rather than a replacement. In the chaotic environment of an emergency room, these tools can act as a safety net to catch critical diagnoses that human doctors might overlook during initial triage.




