Microsoft's Diagnostic AI System Outperforms Physicians on Complex Cases
Microsoft's AI Diagnostic Orchestrator, known as MAI-DxO, correctly diagnosed 85.5% of complex medical cases from the New England Journal of Medicine. A group of 21 experienced physicians from the U.S. and U.K., tested on the same cases, achieved a mean accuracy of 20%. The system also reached correct diagnoses at lower cost by ordering fewer virtual diagnostic tests.
The results remain under external peer review and have not yet been published in a scientific journal as of mid-March 2026.
The benchmark marks a shift in how healthcare organizations deploy AI-moving from administrative tasks toward direct involvement in diagnosis and clinical decisions.
From Documentation to Clinical Reasoning
Healthcare AI once automated notes and referrals. The technology is now handling clinical reasoning.
A Google Cloud report found that 44% of healthcare executives had AI agents in production as of late 2025. Organizations are reallocating budgets toward systems capable of executing defined clinical decisions under human supervision. The same report found that 90% of healthcare leaders reported positive returns from generative AI, particularly in patient screening, imaging analysis and automated documentation.
MAI-DxO does not rely on a single model. Instead, it simulates a panel of clinicians by coordinating multiple language models that ask questions, order virtual tests and refine reasoning before making a diagnosis. This approach mirrors how physicians collaborate in practice.
Microsoft acknowledged the benchmark's limits. The system used curated case records, not real-time patient interactions. Physicians in the study worked alone, without colleagues or external references-unlike typical clinical practice.
Consumer-Facing Diagnostic Tools
Microsoft introduced Copilot Health, a consumer platform that extends diagnostic capability into personal healthcare management. The system aggregates personal health records, wearable data from more than 50 devices and lab results from over 50,000 U.S. hospitals, synthesizing that information into personalized insights.
Microsoft positioned Copilot Health as an early step toward what it describes as "medical superintelligence," rather than a tool for delivering clinical diagnoses.
AI as Healthcare's Entry Point
Patients are increasingly turning to AI before visiting a doctor. Microsoft said its platforms, including Bing and Copilot, now handle more than 50 million health-related sessions per day. OpenAI reported that more than 40 million people use ChatGPT daily for health-related queries, with roughly 70% of those interactions occurring outside traditional clinic hours.
In underserved and rural areas, AI tools are meeting demand that existing healthcare infrastructure cannot address. This positions AI systems as an entry point into the healthcare system, influencing how and when patients seek care.
Rather than beginning with a primary care visit, many patients now start with conversational AI, which can shape symptom interpretation, triage decisions and provider selection.
Economic Case for Diagnostic AI
U.S. healthcare spending is approaching 20% of gross domestic product. Microsoft estimates that up to 25% of that spending produces little measurable improvement in patient outcomes.
Systems that reduce diagnostic uncertainty earlier in the care journey could lower costs while improving efficiency. This strengthens the case for broader deployment across healthcare organizations.
The Risk of Confident Wrong Answers
Generative AI sometimes delivers confident but incorrect answers-a serious problem in clinical settings where accuracy directly affects patient care.
The question facing healthcare is not whether AI will affect diagnosis, but how fast the technology will be deployed and under which constraints. As performance rises and commercial pressure grows, the path forward will depend on accountability, oversight and trust in healthcare systems.
Learn more about AI for Healthcare and Microsoft AI courses to understand how these tools work in practice.
Your membership also unlocks: