{"type":"script","doc_desc":{"producers":[{"name":"Reproducibility WBG","abbr":"DECDI","affiliation":"World Bank - Development Impact Department","role":"Verification and preparation of metadata"}],"prod_date":"2026-05-26","version":"1"},"project_desc":{"authoring_entity":[{"name":"Emily Cook","affiliation":"Texas A&M University","email":"ecook4@tamu.edu"},{"name":"Devaki Ghose","affiliation":"World Bank","email":"dghose@worldbank.org"},{"name":"Ekaterina Khmelnitskaya","affiliation":"University of British Columbia, Sauder School of Business","email":"ekaterina.khmelnitskaya@sauder.ubc.ca"}],"title_statement":{"title":"Reproducibility package for Federal Research Funding And STEM Education","idno":"RR_USA_2026_618"},"data_statement":"Some data is limited-access and has not been included in the reproducibility package. For more details, please refer to the README file.","software":[{"name":"Stata","version":"19.5 MP"}],"scripts":[{"title":"Reproducibility package for Federal Research Funding And STEM Education","date":"2026-05","notes":"Computational reproducibility verified by Development Impact (DECDI) Analytics team, World Bank.","instructions":"See README in reproducibility package.","file_name":"RR_USA_2026_618","zip_package":"RR_USA_2026_618.zip","dependencies":"Stata dependencies are listed in the ado folder."}],"repository_uri":[{"name":"Reproducible Research Repository (World Bank)","uri":"https:\/\/reproducibility.worldbank.org"}],"production_date":"2026-05-26","abstract":"We examine how federal science and engineering research funding\u2014though intended to advance research\u2014affects degree production and programs offered in STEM. Using data from 1971\u20132016, we implement a triple-difference design that exploits variation across colleges, time, and fields of study. We find that federal grants generate 27.4% of doctorates and 14.7% of undergraduate STEM degrees, as well as 6.3% of doctoral programs and 3.7% of undergraduate programs in STEM annually across 200 U.S. research universities. Impacts are concentrated in biology and engineering, aligning with the priorities of major funders such as HHS, NSF, and DOD. These findings suggest that research grants to universities may generate a \u201cdouble dividend,\u201d simultaneously expanding the supply of skilled labor in targeted fields while also advancing scientific discovery.","geographic_units":[{"name":"United States of America","code":"USA"}],"keywords":[{"name":"Federal Research Funding"},{"name":"Higher Education"},{"name":"Stem"},{"name":"Major Choice"},{"name":"Innovation"}],"topics":[{"id":"H52","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Government Expenditures and Education","parent_id":"H5"},{"id":" I23","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Higher Education \u2022 Research Institutions","parent_id":"I2"},{"id":" I28","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Government Policy","parent_id":"I2"},{"id":" O31","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Innovation and Invention: Processes and Incentives","parent_id":"O3"},{"id":" O38","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Government Policy","parent_id":"O3"}],"output":[{"type":"Working Paper","description":"Policy Research Working Papers (PRWP)","title":"Federal Research Funding And STEM Education"}],"language":[{"name":"English","code":"EN"}],"disclaimer":"The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development\/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.","license":[{"name":"MIT License","uri":"https:\/\/opensource.org\/license\/mit"},{"name":"World Bank IGO Rider","uri":"https:\/\/github.com\/worldbank\/metadata-editor\/blob\/main\/WB-IGO-RIDER.md"}],"contacts":[{"name":"Emily Cook","affiliation":"Texas A&M University","email":"ecook4@tamu.edu"},{"name":"Reproducibility WBG","affiliation":"World Bank","email":"reproducibility@worldbank.org"}],"datasets":[{"name":"College Board College Handbook \u2013 Admissions Data","note":"Data accessed from print editions of the College Board's College Handbook (1968 to 2002), covering 34 academic years. Institution-level SAT and ACT score distributions manually digitized by the authors from annual print volumes. Each sheet in the Excel file corresponds to one year. File location: Data\/Raw\/All Admissions Data.xlsx (Sheets 1\u201326).","access_type":"Data is publicly available and included in the reproducibility package.","citation":"College Board. Various years. \"The College Handbook\" [dataset]. New York: College Board. Data digitized by the authors from print volumes."},{"name":"Crosswalk Between CIP and HEGIS Taxonomy, 1981","note":"Data accessed in May 2017. Fixed-width text file downloaded from ICPSR mapping HEGIS taxonomy codes to CIP codes and program titles. Used to construct a crosswalk between HEGIS subject codes and CIP codes for the 1983 and 1984 transitional survey years. Data is freely available to data users at ICPSR member institutions. File location: Data\/Raw\/HEGIS_Degrees\/CIP_HEGIS_Crosswalk_1981\/03135-0001-Data.txt.","access_type":"Data access requires purchase or human approval and is not included in the reproducibility package.","license_uri":"https:\/\/www.icpsr.umich.edu\/sites\/icpsr\/about\/policies\/terms-of-use","uri":"https:\/\/www.icpsr.umich.edu\/web\/ICPSR\/studies\/3135","citation":"United States Department of Education. National Center for Education Statistics. Crosswalk Between CIP and HEGIS Taxonomy, 1981. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2003-12-02. https:\/\/doi.org\/10.3886\/ICPSR03135.v1"},{"name":"Higher Education General Information Survey (HEGIS)","note":"Data accessed in May\u2013June 2017. Annual HEGIS microdata files downloaded from ICPSR. Includes three components: (1) Degrees Conferred (1965\u201366 through 1984\u201385, 17 files): one record per institution\u2013field\u2013degree level combination, reporting degrees conferred by sex across bachelor's, master's, doctoral, and first-professional degrees. File locations: Data\/Raw\/HEGIS_Degrees\/{year}_ICPSR_{dataset no}\/; (2) Financial Statistics (1968\u201369 through 1985\u201386, 18 files): institution-level revenues and expenditures. File locations: Data\/Raw\/HEGIS_Finance\/{year}_Finance_{dataset no}\/; (3) Institutional Characteristics (selected years: 1970, 1972\u20131974, 1977\u20131978, 1983, 7 files): institution-level tuition, fees, room and board, control type, and affiliation. File locations: Data\/Raw\/HEGIS_InstitutionalChars\/{year}_InstChar_ICPSR_{dataset no}\/. Individual ICPSR study URLs for each year are listed in the README, and detailed instructions to download the files are included in the file \"HEGISDataDownloadGuide\". Data is freely available to data users at ICPSR member institutions.","access_type":"Data access requires purchase or human approval and is not included in the reproducibility package.","license_uri":"https:\/\/www.icpsr.umich.edu\/sites\/icpsr\/about\/policies\/terms-of-use","uri":"https:\/\/www.icpsr.umich.edu\/web\/ICPSR\/studies","citation":"United States Department of Education, National Center for Education Statistics. 1965\u20131986. \"Higher Education General Information Survey (HEGIS)\" [dataset]. Inter-university Consortium for Political and Social Research [distributor]. https:\/\/www.icpsr.umich.edu\/web\/ICPSR\/series\/30. Accessed May 2017."},{"name":"Integrated Postsecondary Education Data System (IPEDS) \u2013 Completions, Finance, and Institutional Characteristics Surveys","note":"Data accessed in June 2017. Annual IPEDS microdata files downloaded from the NCES IPEDS Data Center. Includes three components: (1) Completions Survey (1987\u20132020, 34 files): one record per institution\u2013CIP code\u2013degree level combination, reporting degrees conferred by sex. Files named dct_c{year}.dta. File location: Data\/Raw\/IPEDS_Degrees\/; (2) Finance Survey (1987\u20132022): institution-level revenues and expenditures. File location: Data\/Raw\/IPEDS_Finance\/; (3) Institutional Characteristics Survey (1987\u20132023): institution-level tuition, fees, control type, and other institutional attributes. Files are split across multiple parts per year (ranging from two to seven). For years 2014\u20132023, the Admissions component is also included. File location: Data\/Raw\/IPEDS_InstitutionalChars\/. Detailed instructions to download the files can be found in the README. ","access_type":"Data is publicly available and included in the reproducibility package.","uri":"https:\/\/nces.ed.gov\/ipeds\/datacenter\/DataFiles.aspx","citation":"United States Department of Education. National Center for Education Statistics. 1987\u20132023. Integrated Postsecondary Education Data System (IPEDS). U.S. Department of Education [distributor]. https:\/\/nces.ed.gov\/ipeds\/datacenter\/DataFiles.aspx.  Accessed June 2017.","license_uri":"https:\/\/ies.ed.gov\/about\/public-access-research"},{"name":"NCES Classification of Instructional Programs (CIP) \u2013 All CIP 1985\u20132000 Tables","note":"Data accessed in September 2024. Excel file containing all CIP code lists and crosswalks for the 1985, 1990, and 2000 taxonomies. Detailed instructions to download the files can be found in the README.  File location: Data\/Raw\/All CIP.xls.","access_type":"Data is publicly available and included in the reproducibility package.","uri":"https:\/\/nces.ed.gov\/ipeds\/cipcode\/resources.aspx?y=55","citation":"National Center for Education Statistics. Classification of Instructional Programs (CIP): All CIP 1985\u20132000 Tables. U.S. Department of Education. https:\/\/nces.ed.gov\/ipeds\/cipcode\/resources.aspx?y=55. Accessed September 2024.","license_uri":"https:\/\/ies.ed.gov\/about\/public-access-research"},{"name":"Survey of Federal Science and Engineering Support to Universities, Colleges, and Nonprofit Institutions","note":"Data accessed in September 2025. Microdata files containing records of federal research funding obligations by institution and agency (1971\u2013present), obtained directly from National Science Foundation (NSF) staff. Three files: (1) data_obligation.dta \u2014 main funding records; (2) c_agency.dta \u2014 agency codes and labels; (3) c_inst_codebook.dta \u2014 institution identifiers including name, FICE code, state, and institution type. File location: Data\/Raw\/FSS_Support_Survey\/Microdata_stata_export\/.","access_type":"Data access was granted directly to the study authors by the data owners. It allows for redistribution and it is included in the reproducibility package.","citation":"National Science Foundation, National Center for Science and Engineering Statistics. 2025. \"Survey of Federal Science and Engineering Support to Universities, Colleges, and Nonprofit Institutions\" [dataset]. Microdata, 1971\u2013present. Data provided by NSF staff. Accessed September 2025."},{"name":"NSF Federal Support to Universities and Colleges, Fiscal Years 1963\u20131970","note":"Data accessed in July 2024. Author-digitized Excel files containing institution-level total federal research funding data for fiscal years 1963\u20131970, manually transcribed from scanned NSF annual reports available through HathiTrust and ERIC. No digital data version exists; the original print reports are public domain government documents. Five files covering FY 1963\u20131970.  Individual report URLs are listed in the README. File location: Data\/Raw\/FSS_Support_Survey\/FSS_support_preperiod\/.","access_type":"Data is publicly available and included in the reproducibility package.","uri":"https:\/\/catalog.hathitrust.org\/Record","citation":"Authors' compilation. 2024. \"NSF Federal Support to Universities and Colleges, Fiscal Years 1963\u20131970\" [dataset]. Digitized from: National Science Foundation. Federal Support to Universities and Colleges, Fiscal Years 1963\u20131970.","license_uri":"https:\/\/www.hathitrust.org\/the-collection\/search-access\/access-use-policy\/"},{"name":"Carnegie Classification of Institutions of Higher Education, 1994 Edition","note":"Data accessed in July 2021. Institution-level data from the 1994 Carnegie Classification used to identify and filter the analysis sample to doctoral\/research universities. Contains institution identifiers including FICE code, UNITID, institution name, and state. Note: the 1994 version is no longer available online. File locations: Data\/Raw\/Carnegie_1994_edition_data.xls; Data\/Raw\/Carnegie_1994_edition_data.csv.","access_type":"Data is publicly available and included in the reproducibility package.","license":"Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License","uri":"https:\/\/carnegieclassifications.acenet.edu\/","citation":"Carnegie Foundation for the Advancement of Teaching. 1994. \"A Classification of Institutions of Higher Education, 1994 Edition\" [dataset]. Princeton, NJ: Carnegie Foundation for the Advancement of Teaching. Accessed July 2021."},{"name":"Consumer Price Index, 1913\u2013present","note":"Data accessed in February 2020 and May 2024. Annual CPI values from the Federal Reserve Bank of Minneapolis used to construct deflator multipliers to real 2019 and real 2022 dollars. Two files: (1) CPI_long_2019.dta \u2014 deflates to real 2019 dollars; (2) CPI_long_2022.dta \u2014 deflates to real 2022 dollars. File location: Data\/Raw\/.","access_type":"Data is publicly available and included in the reproducibility package.","uri":"https:\/\/www.minneapolisfed.org\/about-us\/monetary-policy\/inflation-calculator\/consumer-price-index-1913-","citation":"Federal Reserve Bank of Minneapolis. Consumer Price Index, 1913\u2013presen. https:\/\/www.minneapolisfed.org\/about-us\/monetary-policy\/inflation-calculator\/consumer-price-index-1913-. Accessed February 2020 and May 2024.","license_uri":"https:\/\/www.minneapolisfed.org\/site-information\/disclaimer"},{"name":"HEGIS Subject Code Lookup Tables","note":"Author-constructed lookup tables mapping HEGIS subject codes (ITEMNUM) to field names, covering three distinct ranges of survey years reflecting changes in HEGIS's subject coding taxonomy over time. Used to label degree records before HEGIS switched to CIP codes in 1983. Three files: (1) lookupfields_65_67.dta (1965\u20131967); (2) lookupfields_68.dta (1968); (3) lookupfields_70_81.dta (1970\u20131981). File location: Data\/Raw\/HEGIS_Degrees\/.","access_type":"Data is publicly available and included in the reproducibility package.","citation":"Authors' compilation. \"HEGIS Subject Code Lookup Tables\" [dataset]. Based on HEGIS subject coding taxonomy documentation."},{"name":"State Identifier Crosswalk Files","note":"Author-constructed crosswalk files mapping state identifier codes to state names, used to standardise geographic identifiers across HEGIS survey years. Two files: (1) states.dta \u2014 maps the OESTATE variable used in 1965\u20131967; (2) states2.dta \u2014 maps the GEOGCODE variable used from 1973 onward. File location: Data\/Raw\/HEGIS_Degrees\/.","access_type":"Data is publicly available and included in the reproducibility package.","citation":"Authors' compilation. \"State Identifier Crosswalk Files\" [dataset]. Based on HEGIS geographic identifier documentation."},{"name":"CIP 2000 to Broad Major Group Crosswalk","note":"Author-constructed crosswalk mapping 2-digit CIP codes (2000 taxonomy) to seven broad major group categories used in the analysis: Arts and Architecture, Business and Communications, Education, Engineering\/Math\/Sciences, Health, Social Science and Humanities, and Vocational\/Other. Used throughout IPEDS completions processing (1987\u20132020). File location: Data\/Raw\/2000aligncip.dta.","access_type":"Data is publicly available and included in the reproducibility package.","citation":"Authors' compilation. \"CIP 2000 to Broad Major Group Crosswalk\" [dataset]. Based on the NCES Classification of Instructional Programs (CIP) 2000 taxonomy."}],"technology_requirements":"Run time: ~ 50 minutes.","technology_environment":"Paper exhibits were reproduced on a computer with the following specifications:\n\u2022 OS: Windows 11 Enterprise\n\u2022 Processor: INTEL(R) XEON(R) PLATINUM 8562Y+ (2.80 GHz) (2 processors)\n\u2022 Memory available: 32 GB","reproduction_instructions":"To reproduce the findings in this paper, a replicator must:\n1. **Secure Access to Data:** Access the datasets not included in the package. See the Datasets section for more details.\n2. **Run the Package:**\n  - Update the working directory in line 9 of the do-file `main`, and run it.\n\nSince all the data is not included, the package includes the results produced by replicators. These files can be used to review the results presented in the paper."},"datacite":{"creators":[{"givenName":"Emily","familyName":"Cook","nameType":"Personal","affiliation":[{"name":"Texas A&M University","affiliationIdentifierScheme":"ROR","schemeUri":"https:\/\/ror.org","affiliationIdentifier":"https:\/\/ror.org\/01f5ytq51"}]},{"givenName":"Devaki","familyName":"Ghose","nameType":"Personal","affiliation":[{"name":"World Bank","affiliationIdentifier":"https:\/\/ror.org\/00ae7jd04","affiliationIdentifierScheme":"ROR","schemeUri":"https:\/\/ror.org"}]},{"givenName":"Ekaterina","familyName":"Khmelnitskaya","nameType":"Personal","affiliation":[{"name":"University of British Columbia, Sauder School of Business","affiliationIdentifierScheme":"ROR","schemeUri":"https:\/\/ror.org","affiliationIdentifier":"https:\/\/ror.org\/03rmrcq20"}]}],"titles":[{"lang":"en","title":"Reproducibility package for Federal Research Funding And STEM Education"},{"title":"RR_USA_2026_618","titleType":"Other"}],"publisher":"World Bank","publicationYear":"2026","types":{"resourceType":"Reproducibility package","resourceTypeGeneral":"Other"},"url":"https:\/\/reproducibility.worldbank.org\/index.php\/catalog\/study\/RR_USA_2026_618","language":"en"},"tags":[{"tag":"DOI"},{"tag":"Limited-access Data"},{"tag":"Open Code"}],"schematype":"script"}