Reproducible Research Repository
Reproducible Research Repository
  • Home
  • Repository
  • Collections
  • About
    Home / Repository / JA / PP_WLD_2025_516
ja

Reproducibility package for Technology Sophistication Across Establishments

2026
Get Reproducibility Package
Reference ID
PP_WLD_2025_516
Author(s)
Marcio Cruz, Xavier Cirera, Diego Comin
Collections
Journal articles
Metadata
JSON
Created on
Feb 09, 2026
Last modified
Feb 23, 2026
  • Project Description
  • Downloads
  • Overview
  • Reproducibility Package
  • Description
  • Scope and coverage
  • Disclaimer
  • Access and rights
  • Contacts
  • Information on metadata
  • Overview

    Abstract

    We study technology sophistication using a novel approach that measures the sophistication of the most advanced (MAX) and the most widely used (MOST) technologies in each of the key business functions within establishments. Using data from over 21,000 establishments across 15 countries, we find that establishments generally underutilize the most sophisticated technologies available within a business function. These MAX-MOST gaps are persistent and strongly associated with productivity both across establishments and countries. At the establishment level, there is substantial variation in both MAX and MOST, with MOST showing a more skewed distribution. MAX and MOST follow different lifecycle patterns in low-income countries and among small establishments, and they exhibit different associations with several establishment characteristics and performance indicators. This evidence underscores the different nature of the technology upgrading processes that drive MAX and MOST.

    Reproducibility Package

    Scripts
    Readme Get Reproducibility Package
    Link: https://reproducibility.worldbank.org/catalog/463/download/1322/README.pdf
    Reproducibility package for Technology Sophistication Across Establishments
    File name
    PP_WLD_2025_516
    Zip package
    PP_WLD_2025_516.zip
    Title
    Reproducibility package for Technology Sophistication Across Establishments
    Date
    2026-01
    Dependencies
    Stata dependencies are listed in the ado folder.
    Instructions
    See README in reproducibility package.
    Notes
    Computational reproducibility verified by Development Impact (DECDI) Analytics team, World Bank.
    Source code repository
    Repository name URI
    Reproducible Research Repository (World Bank) https://reproducibility.worldbank.org
    Software
    Stata
    Name
    Stata
    Version
    18.5 MP

    Reproducibility

    Technology environment

    Paper exhibits were reproduced on a computer with the following specifications:
    • OS: Windows 11 Enterprise
    • Processor: INTEL(R) XEON(R) PLATINUM 8562Y+ 2.80 GHz (2 processors)
    • Memory available: 32.0 GB

    Technology requirements

    Run time: ~ 30 minutes

    Reproduction instructions

    Replication of this package was conducted in part through virtual verification due to data access restrictions. To reproduce the findings in this paper, a replicator must:

    1. Secure Access to Data: Access the datasets not included in the package. See subsection Datasets for more details.
    2. Download and Place Data: Once the data is accessed, users should place it in the appropriate folder.
    3. Run the Package: After placing the data in the folder, run the files in the order:
      • Update the global in line 10 of the do-file "0_master" to your folder's location and run the do-file.

    Since not all underlying data are included in the package, it contains the outputs generated by the replicators. These files allow users to review and compare the results presented in the paper. A subset of exhibits was verified via virtual verification; these are identified in the reproducibility report, and the corresponding verification outputs are included in the folder 3_output/Virtual verification.

    Data

    Datasets
    Firm-level Adoption of Technology (FAT) Dataset
    Name
    Firm-level Adoption of Technology (FAT) Dataset
    Note
    This entry covers the datasets underlying the Firm-level Adoption of Technology (FAT) project, including the multi-country firm-level survey data, the technology sophistication grid rankings and validation data, relative productivity measures used for Q-cardinalization, and ISIC sectoral classifications. The FAT survey is a multi-country, multi-sector representative firm-level survey covering 15 countries and collecting information on technologies used across key business functions. The survey files are located in 1_data/ and include FAT0_raw_data_qje.dta and mgt_z.dta. The technology sophistication grid data consist of expert rankings and validation exercises conducted using ChatGPT. These files are located in 1_data/Figures_A12_A16/ and include sector- and function-level CSV files for agriculture, apparel, food processing, retail, marketing, production planning, quality control, sales, sourcing, and related business functions. The relative productivity dataset, located in 1_data/ as relative_q_productivities.xlsx, contains author-generated productivity estimates used to construct Q-cardinalization indices, based on peer-reviewed academic research, structured expert interviews, industry studies, and commercial sources, following the methodology described in the appendix. The ISIC classification data, located in 1_data/ as isic_data.dta, provide sectoral classifications derived from establishment-level product descriptions in the FAT dataset.
    Access policy
    Data is publicly available but does not allow redistribution and it is not included in the reproducibility package.
    License URL
    https://microdata.worldbank.org/terms-of-use
    Data URL
    https://microdata.worldbank.org/catalog/8209
    Citation
    Cirera, Xavier, Comin, Diego, and Cruz, Marcio (Forthcoming). Technology Sophistication Across Establishments [Dataset]. The Quarterly Journal of Economics.
    Firm-level Adoption of Technology (FAT) - Poland Dataset
    Name
    Firm-level Adoption of Technology (FAT) - Poland Dataset
    Note
    The FAT dataset for Poland is hosted in the National Statistics Office. Some variables (see data dictionary) are only accessible in this center. Access needs to be requested from Statistics Poland https://stat.gov.pl/en/ File location: 1_data/ Files: TAS_naukowiec.dta
    Access policy
    Data access was granted directly to the study authors by the data owners/managers. It was obtained with a custom data license that does not allow for redistribution and it is not included in the reproducibility package.
    Citation
    Cirera, Xavier, Comin, Diego, and Cruz, Marcio (Forthcoming). Technology Sophistication Across Establishments - FAT Poland [Dataset]. The Quarterly Journal of Economics.
    World Development Indicators
    Name
    World Development Indicators
    Note
    File location: 1_data/ Files: country_stats_2080.xlsx
    Access policy
    Data is publicly available but does not allow redistribution and it is not included in the reproducibility package.
    License URL
    https://microdata.worldbank.org/terms-of-use
    Data URL
    https://microdata.worldbank.org/catalog/8209
    Citation
    World Bank. 2024. "World Development Indicators" [Dataset]. Indicator accessed: Log of GDP per capita, PPP. Accessed 2025. https://databank.worldbank.org/source/world-development-indicators
    Brazil Relação Anual de Informações Sociais (RAIS)
    Name
    Brazil Relação Anual de Informações Sociais (RAIS)
    Note
    These datasets were compiled by the authors using two data sources: Brazil Relação Anual de Informações Sociais (RAIS) 2017 and 2018 and Firm-level Adoption of Technology (FAT) dataset for Brazil. The datasets are confidential, as they contain identifiable firm-level information (e.g., tax identifiers). Access to the underlying RAIS microdata must be requested from the Ministério do Trabalho e Emprego (https://www.gov.br/pt-br/servicos/solicitar-vinculos-empregaticios-da-rais). See the README for additional details. File location: Inputs/ Files: RAIS_firm_level_2017.dta; RAIS_firm_level_2018.dta; FAT0_brazil.dta; FAT_treated.dta; Final_merged_RAIS_2017.dta
    Access policy
    Data access was granted directly to the study authors by the data owners/managers. It was obtained with a custom data license that does not allow for redistribution and it is not included in the reproducibility package.
    Citation
    Cirera, Xavier, Comin, Diego, and Cruz, Marcio (Forthcoming). Technology Sophistication Across Establishments - Brazil (RAIS-linked confidential data) [Dataset]. The Quarterly Journal of Economics.
    Firm-level Adoption of Technology (FAT) Dataset - Interviewer Data
    Name
    Firm-level Adoption of Technology (FAT) Dataset - Interviewer Data
    Note
    These datasets include confidential information on interviewers from the Firm-level Adoption of Technology (FAT) dataset, as well as information from the original sampling frames provided by the National Statistical Offices of Brazil, Senegal, and Vietnam. See the README for more details. File location: 1_data Files: replication_C28-C29.dta; replication_tableC25-C27_brazil.dta; replication_tableC25-C27_senegal.dta; replication_tableC25-C27_vietnam.dta
    Access policy
    Data access was granted directly to the study authors by the data owners/managers. It was obtained with a custom data license that does not allow for redistribution and it is not included in the reproducibility package.
    Citation
    Cirera, Xavier, Comin, Diego, and Cruz, Marcio (Forthcoming). Technology Sophistication Across Establishments - Interviewer Data [Dataset]. The Quarterly Journal of Economics.
    Data statement

    Some data is restricted and has not been included in the reproducibility package. For more details, refer to the README file.

    Description

    Output
    Technology Sophistication Across Establishments
    Type
    Journal Article
    Title
    Technology Sophistication Across Establishments
    Description
    Journal Articles
    Authors
    Author Affiliation Email
    Marcio Cruz IFC marciocruz@ifc.org
    Xavier Cirera World Bank xcirera@worldbank.org
    Diego Comin Dartmouth College diego.comin@dartmouth.edu
    Date of production

    2026-01-23

    Scope and coverage

    Geographic locations
    Location Code
    World WLD
    Keywords
    Technology Firms Business Functions
    Topics
    ID Topic Parent topic ID Vocabulary Vocabulary URI
    D22 Firm Behavior: Empirical Analysis D2 Journal of Economic Literature (JEL)
    O14 Industrialization • Manufacturing and Service Industries • Choice of Technology O1 Journal of Economic Literature (JEL)
    O33 Technological Change: Choices and Consequences; Diffusion Processes O3 Journal of Economic Literature (JEL)

    Disclaimer

    Disclaimer

    The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.

    Access and rights

    License
    Name URI
    Modified BSD3 https://opensource.org/license/bsd-3-clause/

    Contacts

    Contacts
    Name Affiliation Email
    Marcio Cruz IFC marciocruz@ifc.org
    Reproducibility WBG World Bank reproducibility@worldbank.org

    Information on metadata

    Producers
    Name Abbreviation Affiliation Role
    Reproducibility WBG DECDI World Bank - Development Impact Department Verification and preparation of metadata
    Date of Production

    2026-01-23

    Document version

    1

    Back to Catalog
    The World Bank Working for a World Free of Poverty
    • IBRD IDA IFC MIGA ICSID

    © The World Bank Group, All Rights Reserved.