Reproducible Research Repository
Reproducible Research Repository
  • Home
  • Repository
  • Collections
  • About
    Home / Repository / PRWP / RR_NGA_2025_481
PRWP

Reproducibility package for Can LLMs Improve Healthcare Delivery? Evidence From Physician Review And Objective Testing

2025
Get Reproducibility Package
Reference ID
RR_NGA_2025_481
Author(s)
Anja Sautmann, Jason Abaluck, Robert Pless, Nirmal Ravi, Aaron Schwartz
Collections
World Bank Policy Research Working Papers
Metadata
JSON
Created on
Jan 15, 2026
Last modified
Jan 15, 2026
  • Project Description
  • Downloads
  • Overview
  • Reproducibility Package
  • Description
  • Scope and coverage
  • Disclaimer
  • Access and rights
  • Contacts
  • Information on metadata
  • Overview

    Abstract

    We deployed large language model (LLM) decision support using GPT4 for health workers at two outpatient clinics in Nigeria. For each patient, health workers drafted care plans that were optionally revised after LLM feedback. We compared unassisted and assisted plans using (i) blinded randomized assessments by on-site physicians who assessed and treated the same patients and (ii) results from laboratory tests for common conditions. Academic physicians performed blinded retrospective reviews of a subset of notes. Providers reported high satisfaction with LLM feedback, and retrospective academic reviewers rated LLM-assisted plans more favorably. However, on-site physicians observed little to no improvement in diagnostic alignment or treatment decisions. Objective testing showed mixed effects of LLM-assistance, with reduced over testing for malaria but increased over testing for urinary tract infection and anemia. This highlights a gap between chart-based reviews and real-world clinical relevance that may be especially important in evaluating the effectiveness of LLM based interventions.

    Reproducibility Package

    Scripts
    Readme Get Reproducibility Package
    Link: https://reproducibility.worldbank.org/catalog/442/download/1258/README.pdf
    Reproducibility package for Can LLMs Improve Healthcare Delivery? Evidence From Physician Review And Objective Testing
    File name
    RR_NGA_2025_481
    Zip package
    RR_NGA_2025_481.zip
    Title
    Reproducibility package for Can LLMs Improve Healthcare Delivery? Evidence From Physician Review And Objective Testing
    Date
    2025-12
    Dependencies
    Stata dependencies are listed in the ado folder.
    Instructions
    See README in reproducibility package.
    Notes
    Computational reproducibility verified by Development Impact (DECDI) Analytics team, World Bank.
    Source code repository
    Repository name URI
    Reproducible Research Repository (World Bank) https://reproducibility.worldbank.org
    Software
    Stata
    Name
    Stata
    Version
    18 MP

    Reproducibility

    Technology environment

    Paper exhibits were reproduced on a computer with the following specifications:
    • OS: Windows 11 Enterprise
    • Processor: INTEL(R) XEON(R) PLATINUM 8562Y+ 2.80 GHz (4 processors)
    • Memory available: 32.0 GB

    Technology requirements

    Run time: ~ 5 minutes

    Reproduction instructions
    1. Secure Access to Data: Access the datasets not included in the package. See subsection Datasets and the README for more details.
    2. Download and Place Data: Once the data is accessed, users should place it in the appropriate folder.
    3. Run the Package: After placing the data in the folder, run the files in the order:
      • Update the global in line 7 of the do-file "main" to your folder's location and run the do-file.

    Since all the data is not included, the package includes the results produced by replicators. These files can be used to review the results presented in the paper.

    Data

    Datasets
    Does LLM Assistance Improve Healthcare Delivery? An Evaluation Using On-site Physicians and Laboratory Tests 2025
    Name
    Does LLM Assistance Improve Healthcare Delivery? An Evaluation Using On-site Physicians and Laboratory Tests 2025
    Note
    The analysis uses de-identified survey data, electronic medical records (EMR), patient flow data, clinical reference files, and LLM-generated evaluation outputs. Data were collected between January and October 2025 and include quality-of-care assessments, EMR records with identifiers removed, study flow information, and reference materials compiled by the research team. Raw data files should be placed in build/data/. The data can be accessed by World Bank staff via the internal World Bank Microdata Library. A detailed list of datasets is provided in data_has_report.csv.
    Access policy
    Data access requires purchase or human approval and is not included in the reproducibility package.
    License
    Research microdata with license
    License URL
    https://microdatalib.worldbank.org/index.php/data-access/#research_license
    Data URL
    https://microdatalib.worldbank.org/index.php/catalog/17051/
    Citation
    Abaluck, J., Pless, R., Ravi, N., Sautmann, A., & Schwartz, A. (2025). Does LLM Assistance Improve Healthcare Delivery? An Evaluation Using On-site Physicians and Laboratory Tests 2025 [Data set].
    Data statement

    All data is limited-access and has not been included in the reproducibility package. For more details, please refer to the README file

    Description

    Output
    Can LLMs Improve Healthcare Delivery? Evidence From Physician Review And Objective Testing
    Type
    Working Paper
    Title
    Can LLMs Improve Healthcare Delivery? Evidence From Physician Review And Objective Testing
    Description
    Policy Research Working Papers (PRWP)
    Authors
    Author Affiliation Email
    Anja Sautmann World Bank asautmann@worldbank.org
    Jason Abaluck Yale School of Management, Yale University and the National Bureau of Economic Research jason.abaluck@yale.edu
    Robert Pless Department of Computer Science, George Washington University pless@gwu.edu
    Nirmal Ravi eHealth Africa EHA Clinics Nigeria and Department of Emergency Medicine, George Washington University nirmal.ravi@ehealthafrica.org
    Aaron Schwartz Department of Medical Ethics and Health Policy and Division of General Internal Medicine, University of Pennsylvania Perelman School of Medicine, and Crescenz VA Medical Center aaron.schwartz@pennmedicine.upenn.edu
    Date of production

    2025-12-17

    Scope and coverage

    Geographic locations
    Location Code
    Nigeria NGA

    Disclaimer

    Disclaimer

    The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.

    Access and rights

    License
    Name URI
    Modified BSD3 https://opensource.org/license/bsd-3-clause/

    Contacts

    Contacts
    Name Affiliation Email
    Anja Sautmann World Bank asautmann@worldbank.org
    Reproducibility WBG World Bank reproducibility@worldbank.org

    Information on metadata

    Producers
    Name Abbreviation Affiliation Role
    Reproducibility WBG DECDI World Bank - Development Impact Department Verification and preparation of metadata
    Date of Production

    2025-12-17

    Document version

    1

    Back to Catalog
    The World Bank Working for a World Free of Poverty
    • IBRD IDA IFC MIGA ICSID

    © The World Bank Group, All Rights Reserved.