A peer-reviewed study published in Science this week found that AI emergency room diagnosis using OpenAI’s o1 model outperformed human doctors at identifying patient conditions. Harvard and Beth Israel researchers pitted OpenAI’s o1 model against attending physicians across six clinical experiments, and the AI came out ahead in most of them.
Nigeria’s emergency wards are stretched thin. The diagnostic gap at triage is real and documented. This study asks whether technology can do what the system currently cannot.
What the Harvard Study Actually Found
The most striking experiment used real data from 76 patients who arrived at the Beth Israel emergency room. Two attending physicians and OpenAI’s o1 and 4o models each assessed patients at three distinct moments: initial triage, post-evaluation, and the point where a hospitalization or ICU decision was required.
At initial triage, o1 matched or closely approximated the correct diagnosis in 67% of cases. The two attending physicians hit 55% and 50%, respectively. That is not a marginal gap. As more patient data accumulated, o1’s accuracy climbed to 72.4% and then 81.6%.
The results in treatment planning were even starker. The o1-preview model scored 90% on clinical management tasks. Doctors using GPT-4 as an aid reached only 41%. Doctors referencing standard materials managed 34%.
On a structured medical reasoning test with 80 questions, o1-preview scored full marks on 78. Specialists managed 28. Residents are just 16.
The blind review design strengthens the study’s credibility. Evaluating physicians did not know which diagnoses came from AI and which from humans. “We tested the AI model against virtually every benchmark, and it eclipsed both prior models and our physician baselines,” said Arjun Manrai, who leads an AI lab at Harvard Medical School and is a co-author on the paper.
See Also: Meta's $145B AI Push Is Making Your Gadgets Pricier
What the Researchers Say AI Still Cannot Do
The research team was emphatic: none of this means AI is ready to replace doctors. Every experiment used text-based records and written case descriptions. Emergency medicine involves far more than text.
A doctor reading a patient’s face, assessing pain through body language, listening to breathing, and conducting a physical examination—none of that is captured in an electronic medical record. The researchers stated the study “only evaluated text-based clinical judgment capabilities” and called for prospective clinical trials before any real-world deployment.
Rodman, a Beth Israel physician and co-author, was direct: there is currently “no formal framework for accountability” around AI diagnoses. Patients, he said, still “want humans to guide them through life-or-death decisions.” That instinct is not irrational; it reflects something that accuracy metrics do not measure.
The paper frames its findings as an argument for urgency, not replacement. It calls for prospective trials testing AI alongside human physicians in actual clinical settings.
See Also: Nigeria Orders Banks and Fintechs to Disclose Cyberattacks
The Implication for Nigerian Healthcare
Nigeria faces a severe doctor shortage, a gap documented by the WHO and the Nigerian Medical Association. Emergency departments at Lagos University Teaching Hospital, National Hospital Abuja, and similar institutions regularly operate at capacity, with consultants stretched across more patients than any one person should manage.
A tool that narrows the diagnostic gap at triage has obvious potential here. Time and accuracy matter most at that stage, and that is exactly where o1 outperformed. The question is deployment, accountability, and infrastructure. AI diagnostics require reliable internet, integrated electronic medical records, and a regulatory framework that does not yet exist in Nigeria.
The Harvard study does not hand hospitals a ready product. It gives policymakers and hospital administrators the strongest scientific basis yet to take AI diagnostics seriously. The research was done in Boston. The decision about what happens next should not stay there.
Stay ahead of the technology stories reshaping Africa and the world. Subscribe to the GizPulse newsletter, twice a week, straight to your inbox.



