{"type":"script","doc_desc":{"producers":[{"name":"Reproducibility WBG","abbr":"DIME","affiliation":"World Bank - Development Impact Department","role":"Verification and preparation of metadata"}],"prod_date":"2024-05","version":"1"},"project_desc":{"authoring_entity":[{"name":"Siddhesh Vishwanath Kaushik","role":"Senior Data Scientist","affiliation":"World Bank","email":"skaushik@worldbank.org"},{"name":"Sonja Mitikj","role":"Research Analyst","affiliation":"World Bank","email":"smitikj@worldbank.org"}],"output":[{"type":"Working paper","title":"Bridging the Gap in Trade Reporting: Insights from the Discrepancy Index","authors":"Sonja Mitikj, Siddhesh Kaushik ","description":"Policy Research Working Paper (PRWP)"}],"datasets":[{"name":"UNSD\u2019s Comtrade annual database","note":"Referenced as dianapublic.unsd_wits_annual. The data was accessed on 18-Oct-23. \nDetailed global bilateral trade data in the Standard International Trade Classification (SITC) and Harmonized System (HS) product classification.","uri":"comtradeplus.un.org","access_type":"The dataset is public but not incluced in the reproducibility package. The data access is free up to 100,000 rows per download."},{"name":"CEPII Gravity database (Conte et al., 2022) ","note":"Referenced as diana.cepii_gravity_202211. The data was accessed on 11-Nov-22.\nDifferent measures of bilateral distances.","uri":"cepii.fr","access_type":"The dataset is public but not included in the package."},{"name":"World Bank Statistical Performance Indicators","note":"Referenced as default.spi_index. The data was accessed on 7-Nov-23.\nStatistical Performance Indicators that assess the maturity and performance of national statistical systems.","uri":"https:\/\/github.com\/worldbank\/SPI","access_type":"The dataset is public but not included in the package."},{"name":"World Bank Development Indicator","note":"Referenced as diana.wdi_gdp. The data was accessed on 7-Nov-23.\nGDP per capita, PPP (constant 2017 international $).","uri":"https:\/\/datatopics.worldbank.org\/world-development-indicators\/","access_type":"The dataset is public but not included in the package."},{"name":"WCO Data","note":"Referenced as diana.productlist. The data was accessed on 3-Feb-24.\nThe product names and structure of the Harmonized System method of classifying traded products.","uri":"wcoomd.org","access_type":"The dataset is public but not included in the package."},{"name":"GNI per capita in current USD","note":"Referenced as diana.income_region_country_list. The data was accessed on 3-Feb-22. \nHistorical data for income classification, based on GNI per capita in current USD, using the Atlas method","uri":"datacatalog.worldbank.org","access_type":"The dataset is public but not included in the package."}],"title_statement":{"idno":"RR_WLD_2024_111","title":"Bridging the Gap in Trade Reporting: Insights from the Discrepancy Index"},"production_date":"2024-05","geographic_units":[{"name":"World","code":"WLD"}],"abstract":"Accurate trade data remains central for empirical investigations of international trade and informed formulation of trade policies. However, discrepancies in trade reporting, stemming from reasons such as logistics all the way to deliberate misclassification, pose challenges to obtaining an accurate representation of trade activities. This study provides a systematic examination of these discrepancies by using the Discrepancy Index (DI), a measure of bilateral asymmetry in trade reporting. First, we propose a rich set of country- and product-level indicators that capture both the frequency of misreporting and its impact on the overall recorded trade value. Second, we demonstrate how the discrepancy index database can aid analysis and resolve data reliability issues in international trade. Using this comprehensive dataset, we analyze the general trends in trade data reporting and its reliability, providing empirical insights into the nature and extent of reporting discrepancies. Finally, we demonstrate the practical application of the developed discrepancy database and aggregate indicators through case studies for Senegal, and the Madagascar\u2013 France trade relationship, shedding light on reporter-specific instances. This paper seeks to equip trade analysts and researchers with tools and resources to make informed decisions concerning the use of reported trade data and its mirror. In doing so, this study contributes to the broader endeavor of enhancing the reliability of international trade data, thereby contributing to a more accurate empirical investigation of global trade patterns and their policy ramifications.","language":[{"name":"English","code":"EN"}],"data_statement":"The Discrepancy Index and the aggregate indicators were generated using UN COMTRADE database (https:\/\/comtradeplus.un.org\/). World Bank has access to this database and we replicated the data in DataBricks using the Bulk API. To obtain UN COMTRADE data, please visit https:\/\/shop.un.org\/databases to check how you can access the data. Please note based on UNSD terms you may have to pay a subscription to get access to this data. The output is avaiable via World Bank Data Catalog and the links are provided below. All databases are open and available using license Creative Commons Attribution 4.0.\nAll the other datasets are public and available for free. \nAll datasets are publicly available but not included in the package. ","software":[{"name":"Python","version":"3.10.12"}],"scripts":[{"file_name":"RR_WLD_2024_1110v1.zip","zip_package":"RR_WLD_2024_1110v1.zip","title":"Reproducibility Package for Bridging the Gap in Trade Reporting: Insights from the Discrepancy Index","date":"2024-05","software":"Python","instructions":"See README in reproducibility package","notes":"Computational reproducibility verified by the Development Impact (DIME) Analytics team, World Bank."}],"technology_environment":"Follow the instructions in README.","technology_requirements":"1 hour runtime","reproduction_instructions":"- Databricks Runtime Version: 14.3 LTS (includes Apache Spark 3.5.0, Scala 2.12)\n- Worker Type: Standard_E4ds_v4 (32 GB Memory, 4 cores)","disclaimer":"The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development\/the World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.","license":[{"name":"Modified BSD3","uri":"https:\/\/opensource.org\/license\/bsd-3-clause\/"}],"contacts":[{"name":"Siddhesh Vishwanath Kaushik","email":"skaushik@worldbank.org","affiliation":"World Bank"},{"name":"Reproducibility WBG","affiliation":"World Bank","email":"reproducibility@worldbank.org"}],"identifiers":[{"type":"DOI"}]},"tags":[{"tag":"DOI"}],"schematype":"script"}