Reproducible Research Repository
Reproducible Research Repository
  • Home
  • Repository
  • Collections
  • About
    Home / Repository / PRWP / RR_WLD_2025_452
PRWP

Reproducibility package for What's In A Name? Implications Of Extensive Margin Measurement In International Trade

2025
Get Reproducibility Package
Reference ID
RR_WLD_2025_452
DOI
https://doi.org/10.60572/k62d-xv91
Author(s)
Ana Fernandes, Devaki Ghose, Alejandro Forero, Piyush Panigrahi
Collections
World Bank Policy Research Working Papers
Metadata
JSON
Created on
Feb 26, 2026
Last modified
Feb 26, 2026
  • Project Description
  • Downloads
  • Overview
  • Reproducibility Package
  • Description
  • Scope and coverage
  • Disclaimer
  • Access and rights
  • Contacts
  • Information on metadata
  • Citation
  • Overview

    Abstract

    Recent years have seen a sharp increase in the availability of micro data at the firm and firm-to-firm level in international trade. Data platform providers often use proprietary algorithms that match reported firm names to assign identifiers to companies engaged in trade. We show that identifiers in one such platform suffer from substantial mismeasurement—for example, the same firm may be assigned different identifiers across transactions. We propose an algorithm to clean firm names and generate more accurate firm identifiers. Using these, we compare key exporter and importer indicators, as well as firm-to-firm trade indicators, with those based on the platform’s identifiers. The resulting biases are stark: using platform IDs shrinks the measured population of exporters and importers, inflates their average size, and overstates the concentration of trade among a few firms. It also artificially inflates firm entry into and exit from international markets. These distortions extend to firm-to-firm trade networks, which appear spuriously denser, less concentrated among top sellers or buyers, and far more volatile. Our findings caution against the growing reliance on readily available proprietary firm identifiers in studies of firms’ trade responses to global shocks, particularly through changes in their buyer–supplier networks.

    Reproducibility Package

    Scripts
    Readme Get Reproducibility Package
    Link: https://reproducibility.worldbank.org/catalog/483/download/1388/README.pdf
    Reproducibility package for What's In A Name? Implications Of Extensive Margin Measurement In International Trade
    File name
    RR_WLD_2025_452
    Zip package
    RR_WLD_2025_452.zip
    Title
    Reproducibility package for What's In A Name? Implications Of Extensive Margin Measurement In International Trade
    Date
    2025-11
    Dependencies
    R dependencies are listed in the file renv.lock.
    Instructions
    See README in reproducibility package.
    Notes
    Computational reproducibility verified by Development Impact (DECDI) Analytics team, World Bank.
    Source code repository
    Repository name URI
    Reproducible Research Repository (World Bank) https://reproducibility.worldbank.org
    Software
    R
    Name
    R
    Version
    4.4.1

    Reproducibility

    Technology environment

    Paper exhibits were reproduced on a computer with the following specifications:
    • OS: Windows 11 Enterprise
    • Processor: Intel(R) Core(TM) i5-1145G7 CPU @ 2.60GHz
    • Memory available: 15.7 GB

    Technology requirements

    Runtime: 10 hours.

    Reproduction instructions

    To reproduce the findings in this paper, a new user should:

    1. Obtain access to the restricted data and place it in the appropriate folder as indicated in the README.
    2. Open the project file (.Rproj).
    3. Restore the environment using renv::restore() or manually install the required packages.
    4. Open !master.R.
    5. Run the code.

    Because some of the data is restricted and not included in the reproducibility package, the results produced by the replicators are provided in the Outputs folder. Interested users can compare these results against those included in the reproducibility package.

    The reproducibility package begins from intermediate datasets directly provided by the authors.
    The full process used to generate these intermediate datasets from the raw data is not included in the replication workflow and was not executed by the replicators.
    For transparency purposes, the authors have included sample code demonstrating how to construct the intermediate dataset for one country (India). This sample code is available in the folder: DataCreation/. The sample code is provided for documentation and reference only and is not required to run the reproducibility package.
    For additional information about the full data creation process, interested users may contact the corresponding author at: afernandes@worldbank.org

    Data

    Datasets
    S&P Panjiva Trade Data Platform
    Name
    S&P Panjiva Trade Data Platform
    Note
    Files location: data/raw/trade and data/raw/allidscorrespondence. Proprietary shipment-level export and import data were obtained from the S&P Panjiva trade data platform for seven countries: India (March 2024), Colombia (August 2024), Mexico (July and August 2024), Peru (January 2024), Sri Lanka (August 2024), Uruguay (December 2023), and Vietnam (November 2023). Data are proprietary and not available for public sharing. The authors obtained permission and legitimate access from S&P Panjiva. A list of all the datasets and countries is included in the reproducibility package file: data_hash_report.csv. For more information on this data, please get in touch with the author Ana M. Fernandes (afernandes@worldbank.org).
    Access policy
    Data is restricted and not included in the reproducibility package
    Data URL
    https://panjiva.com/
    Citation
    S&P Global. S&P Panjiva Trade Data Platform [dataset]. Shipment-level export and import data for India, Mexico, Peru, Sri Lanka, Uruguay, and Vietnam. Available under proprietary license at https://panjiva.com/.
    Exporter Dynamics Database (EDD) 3.0
    Name
    Exporter Dynamics Database (EDD) 3.0
    Note
    Files location: data/raw/support/consolidation_1996_2022.dta and data/raw/support/statistics_exporterimporters_2019.xlsx. Data update is forthcoming at the Development Data Hub. Contact: Ana M. Fernandes (afernandes@worldbank.org).
    Access policy
    Data will be publicly available and is included in the reproducibility package
    License
    Creative Commons Attribution 4.0 International (CC BY 4.0)
    License URL
    https://creativecommons.org/licenses/by/4.0/
    Data URL
    https://datacatalog.worldbank.org/search/dataset/0042326/Exporter-Dynamics-Database
    Citation
    World Bank. Exporter Dynamics Database (EDD) 3.0 [dataset]. Washington, D.C.: World Bank.
    Data statement

    Some data is restricted and has not been included in the reproducibility package. For more details, please refer to the README file.

    Description

    Output
    What's In A Name? Implications Of Extensive Margin Measurement In International Trade
    Type
    Working Paper
    Title
    What's In A Name? Implications Of Extensive Margin Measurement In International Trade
    Description
    Policy Research Working Papers (PRWP)
    Authors
    Author Affiliation Email
    Ana Fernandes World Bank afernandes@worldbank.org
    Devaki Ghose World Bank dghose@worldbank.org
    Alejandro Forero World Bank aforero@worldbank.org
    Piyush Panigrahi International Finance Corporation ppanigrahi@ifc.org
    Date of production

    2025-11-11

    Scope and coverage

    Geographic locations
    Location Code
    World WLD
    Keywords
    Firm-Level Trade Exporter Dynamics Importer Dynamics Extensive Margin Intensive Margin Firm-To-Firm Trade Data Mismeasurement Name Cleaning Or Entity Resolution Algorithm
    Topics
    ID Topic Parent topic ID Vocabulary Vocabulary URI
    F10 General F1 Journal of Economic Literature (JEL)
    F14 Empirical Studies of Trade F1 Journal of Economic Literature (JEL)

    Disclaimer

    Disclaimer

    The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.

    Access and rights

    License
    Name URI
    Modified BSD3 https://opensource.org/license/bsd-3-clause/

    Contacts

    Contacts
    Name Affiliation Email
    Ana Fernandes World Bank afernandes@worldbank.org
    Reproducibility WBG World Bank reproducibility@worldbank.org

    Information on metadata

    Producers
    Name Abbreviation Affiliation Role
    Reproducibility WBG DECDI World Bank - Development Impact Department Verification and preparation of metadata
    Date of Production

    2025-11-11

    Document version

    1

    Citation

    Citation
    loading, please wait...
    Citation format
    Export citation: RIS | BibTeX | Plain text
    Back to Catalog
    The World Bank Working for a World Free of Poverty
    • IBRD IDA IFC MIGA ICSID

    © The World Bank Group, All Rights Reserved.