We deployed large language model (LLM) decision support using GPT4 for health workers at two outpatient clinics in Nigeria. For each patient, health workers drafted care plans that were optionally revised after LLM feedback. We compared unassisted and assisted plans using (i) blinded randomized assessments by on-site physicians who assessed and treated the same patients and (ii) results from laboratory tests for common conditions. Academic physicians performed blinded retrospective reviews of a subset of notes. Providers reported high satisfaction with LLM feedback, and retrospective academic reviewers rated LLM-assisted plans more favorably. However, on-site physicians observed little to no improvement in diagnostic alignment or treatment decisions. Objective testing showed mixed effects of LLM-assistance, with reduced over testing for malaria but increased over testing for urinary tract infection and anemia. This highlights a gap between chart-based reviews and real-world clinical relevance that may be especially important in evaluating the effectiveness of LLM based interventions.
| Repository name | URI |
|---|---|
| Reproducible Research Repository (World Bank) | https://reproducibility.worldbank.org |
Paper exhibits were reproduced on a computer with the following specifications:
• OS: Windows 11 Enterprise
• Processor: INTEL(R) XEON(R) PLATINUM 8562Y+ 2.80 GHz (4 processors)
• Memory available: 32.0 GB
Run time: ~ 5 minutes
Since all the data is not included, the package includes the results produced by replicators. These files can be used to review the results presented in the paper.
All data is limited-access and has not been included in the reproducibility package. For more details, please refer to the README file
| Author | Affiliation | |
|---|---|---|
| Anja Sautmann | World Bank | asautmann@worldbank.org |
| Jason Abaluck | Yale School of Management, Yale University and the National Bureau of Economic Research | jason.abaluck@yale.edu |
| Robert Pless | Department of Computer Science, George Washington University | pless@gwu.edu |
| Nirmal Ravi | eHealth Africa EHA Clinics Nigeria and Department of Emergency Medicine, George Washington University | nirmal.ravi@ehealthafrica.org |
| Aaron Schwartz | Department of Medical Ethics and Health Policy and Division of General Internal Medicine, University of Pennsylvania Perelman School of Medicine, and Crescenz VA Medical Center | aaron.schwartz@pennmedicine.upenn.edu |
2025-12-17
| Location | Code |
|---|---|
| Nigeria | NGA |
The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.
| Name | URI |
|---|---|
| Modified BSD3 | https://opensource.org/license/bsd-3-clause/ |
| Name | Affiliation | |
|---|---|---|
| Anja Sautmann | World Bank | asautmann@worldbank.org |
| Reproducibility WBG | World Bank | reproducibility@worldbank.org |
| Name | Abbreviation | Affiliation | Role |
|---|---|---|---|
| Reproducibility WBG | DECDI | World Bank - Development Impact Department | Verification and preparation of metadata |
2025-12-17
1