Publication

Preparing for Pandemics with Large Language Models: An Evaluation of Sensitivity Across COVID-19, Zika, and Monkeypox Case Reports

Nguyen, Dan
Rao, Arya S
Mazumder, Aneesh
Arraiza, Bianca
Aldrich, Alex
Marks, William
Succi, Marc D
Citations
Google Scholar:
Altmetric:
Student Authors
Faculty Advisor
Academic Program
Document Type
Journal Article
Publication Date
2026-03-28
Keywords
Subject Area
Embargo Expiration Date
Abstract

Large language models (LLMs) have emerged as potential tools for early disease characterization and pandemic preparedness due to their ability to interpret complex textual data. This study evaluated the sensitivity of three LLMs: GPT-5, Claude Sonnet 4, and Gemini 2.5 Pro on early case reports of COVID-19, Mpox, and Zika. Each case report was modified to remove explicit diagnostic terms, and models were prompted to identify whether the presentation represented a disease of pandemic potential. Claude Sonnet 4 achieved the highest sensitivity overall across all three diseases. GPT-5 demonstrated inconsistent results, performing poorly on Mpox. Findings highlight significant variability in diagnostic reliability across LLMs, emphasizing the need for multimodal integration, dataset refinement, and ethical oversight. Limitations include the small sample size, retrospective English-language case reports, text-only inputs, and evaluation of known diseases.

Clinical Trial Number. Not applicable.

Source

Nguyen D, Rao AS, Mazumder A, Arraiza B, Aldrich A, Marks W, Succi MD. Preparing for Pandemics with Large Language Models: An Evaluation of Sensitivity Across COVID-19, Zika, and Monkeypox Case Reports. J Med Syst. 2026 Mar 28;50(1):40. doi: 10.1007/s10916-026-02367-4. PMID: 41896422.

Year of Medical School at Time of Visit
Sponsors
Dates of Travel
DOI
10.1007/s10916-026-02367-4
PubMed ID
41896422
Other Identifiers
Notes
Funding and Acknowledgements
Corresponding Author
Related Resources
Related Resources
Repository Citation
Rights
Distribution License