{"type":"script","doc_desc":{"producers":[{"name":"Reproducibility WBG","abbr":"DIME","affiliation":"World Bank - Development Impact Department","role":"Verification and preparation of metadata"}],"prod_date":"2025-01-27","version":"1"},"project_desc":{"authoring_entity":[{"name":"Guillermo Cruces","affiliation":"University of Nottingham & CONICET-CEDLAS-UNLP","email":"guillermo.cruces@nottingham.ac.uk"},{"name":"Gonzalo Vazquez-Bare","affiliation":"UC Santa Barbara","email":"gvazquez@econ.ucsb.edu"},{"name":"Dario Tortarolo","affiliation":"World Bank","email":"dtortarolo@worldbank.org"}],"title_statement":{"title":"Reproducibility package for Design of Partial Population Experiments with an Application to Spillovers in Tax Compliance","idno":"RR_ARG_2025_258"},"data_statement":"Some data is restricted and has not been included in the reproducibility package. For more details, please refer to the README file.","software":[{"name":"R","version":"4.4.2"},{"name":"Stata","version":"17 MP"}],"scripts":[{"title":"Reproducibility package for Design of Partial Population Experiments with an Application to Spillovers in Tax Compliance","date":"2025-01","notes":"Computational reproducibility verified by Development Impact (DIME) Analytics team, World Bank.","notes.1":"Computational reproducibility verified by Development Impact (DIME) Analytics team, World Bank.","dependencies":"Stata dependencies are listed in the ado folder.\nR dependencies are listed in the file renv.lock.","file_name":"RR_ARG_2025_258","zip_package":"RR_ARG_2025_258.zip"}],"repository_uri":[{"name":"Reproducible Research Repository (World Bank)","uri":"https:\/\/reproducibility.worldbank.org"}],"production_date":"2025-01-27","abstract":"We develop a framework to analyze partial population experiments, a generalization of the cluster experimental design where clusters are assigned to different treatment intensities. Our framework al\u0002lows for heterogeneity in cluster sizes and outcome distributions. We study the large-sample behavior of OLS estimators and cluster-robust variance estimators and show that (i) ignoring cluster hetero\u0002geneity may result in severely underpowered experiments and (ii) the cluster-robust variance estimator may be upward-biased when clusters are heterogeneous. We derive formulas for power, minimum detectable effects, and optimal cluster assignment probabilities. All our results apply to cluster experi\u0002ments, a particular case of our framework. We set up a potential outcomes framework to interpret the OLS estimands as causal effects. We implement our methods in a large-scale experiment to estimate the direct and spillover effects of a communication campaign on property tax compliance. We find an increase in tax compliance among individuals directly targeted with our mailing, as well as compliance spillovers on untreated individuals in clusters with a high proportion of treated taxpayers.\n","geographic_units":[{"name":"Argentina","code":"ARG","type":"Country"}],"keywords":[{"name":"Partial Population Experiments"},{"name":"Spillovers"},{"name":"Randomized Control Trials"},{"name":"Cluster Experiments"},{"name":"Two-stage Designs"},{"name":"Property Tax"},{"name":"Tax Compliance"}],"topics":[{"id":"C01","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Econometrics","parent_id":"C0"},{"id":"C93","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Field Experiments","parent_id":"C9"},{"id":"H71","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"State and Local Taxation, Subsidies, and Revenue","parent_id":"H7"},{"id":"H26","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Tax Evasion and Avoidance","parent_id":"H2"},{"id":"H21","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Efficiency \u2022 Optimal Taxation","parent_id":"H2"},{"id":"O23","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Fiscal and Monetary Policy in Development","parent_id":"O2"}],"output":[{"type":"Working Paper","description":"Policy Research Working Papers (PRWP) WPS11059","title":"Design of Partial Population Experiments with an Application to Spillovers in Tax Compliance","authors":"Guillermo Cruces, Dario Tortarolo, Gonzalo Vazquez-Bare","uri":"http:\/\/documents.worldbank.org\/curated\/en\/099627502062533661","doi":"https:\/\/doi.org\/10.1596\/1813-9450-11059"}],"language":[{"name":"English","code":"EN"}],"technology_requirements":"The code takes approximately 75 minutes to run.","disclaimer":"The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development\/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.","license":[{"name":"Modified BSD3","uri":"https:\/\/opensource.org\/license\/bsd-3-clause\/"}],"contacts":[{"name":"Guillermo Cruces","affiliation":"U. of Nottingham & CONICET-CEDLAS-UNLP","email":"guillermo.cruces@nottingham.ac.uk"},{"name":"Reproducibility WBG","affiliation":"World Bank","email":"reproducibility@worldbank.org"}],"datasets":[{"name":"Tax Microdata from the Tres de Febrero Municipality, Argentina","note":"Source:  Finance department, Tres de Febrero Municipality, Argentina. \nDatasets: Base 1 Cuenta Corriente por Cuenta 2019.dta, data_baseline.dta , randomization_final.dta, Base total con datos imprenta.dta, BOLETAS DIGITALES (3).xlsx, suscripciones_20210308.xlsx, Base 1 Cuenta Corriente por Cuenta al 10Mar.dta, cuadras_buffers.dta.\nTo access the data, please contact the following email Jcurrao@tresdefebrero.gov.ar or finanzas@tresdefebrero.gov.ar. \nSee more details on the README file. ","access_type":"Data is confidential and not included in the package. "},{"note":"Source: Bruno Cr\u00e9pon, Esther Duflo, Marc Gurgand, Roland Rathelot, Philippe Zamora, Do Labor Market Policies have Displacement Effects? Evidence from a Clustered Randomized Experiment , The Quarterly Journal of Economics, Volume 128, Issue 2, May 2013, Pages 531\u2013580, https:\/\/doi.org\/10.1093\/qje\/qjt001. \nThe data was provided directly by the authors of the paper to the authors of this reproducibility package. To get the original data please contact eduflo@mit.edu. For more information regarding these data, replicators can contact Dario Tortarolo (dtortarolo@worldbank.org). \nDatasets should be placed in the corresponding folder in the Bases\\Data_other folder.","name":"Replication data for Do Labor Market Policies have Displacement Effects? Evidence from a Clustered Randomized Experiment","access_type":"Data is confidential and not included in the package. "},{"uri":"https:\/\/www.openicpsr.org\/openicpsr\/project\/113595\/version\/V1\/view","name":"Replication data for Together We Will: Experimental Evidence on Female Voting Behavior in Pakistan","note":"Source: Gin\u00e9, Xavier, and Ghazala Mansuri. \u201cTogether We Will: Experimental Evidence on Female Voting Behavior in Pakistan.\u201d American Economic Journal: Applied Economics 10, no. 1 (January 2018): 207\u201335. https:\/\/doi.org\/10.1257\/app.20130480.\nDatasets should be placed in the corresponding folder in the Bases\\Data_other folder.","license":"Creative Commons Attribution 4.0 International Public License","license_uri":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/","access_type":"The data is publicly available but not included in the package. Data can be downloaded in the Data URL."},{"note":"Source: Ichino, N. and Sch\u00fcndeln, M., 2012. Deterring or displacing electoral irregularities? Spillover effects of observers in a randomized field experiment in Ghana. The Journal of Politics, 74(1), pp.292-307. https:\/\/doi.org\/10.1017\/S0022381611001368. \nDatasets should be placed in the corresponding folder in the Bases\\Data_other folder.","name":"Replication data for Deterring or displacing electoral irregularities? Spillover effects of observers in a randomized field experiment in Ghana. ","access_type":"The data is publicly available but cannot be redistributed. Data can be downloaded in the Data URL.","uri":"https:\/\/dataverse.harvard.edu\/dataset.xhtml?persistentId=doi:10.7910\/DVN\/JRPXPK"},{"name":"Replication data for The Short-term Impact of Unconditional Cash Transfers to the Poor: Experimental Evidence from Kenya","note":"Source: Johannes Haushofer, Jeremy Shapiro, The Short-term Impact of Unconditional Cash Transfers to the Poor: Experimental Evidence from Kenya, The Quarterly Journal of Economics, Volume 131, Issue 4, November 2016, Pages 1973\u20132042, https:\/\/doi.org\/10.1093\/qje\/qjw025\nDatasets should be placed in the corresponding folder in the Bases\\Data_other folder.","access_type":"The data is publicly available but cannot be redistributed. Data can be downloaded in the Data URL.","uri":"https:\/\/johanneshaushofer.com\/research"},{"note":"Source: Imai, K., Jiang, Z., & Malani, A. (2020). Causal Inference With Interference and Noncompliance in Two-Stage Randomized Experiments. Journal of the American Statistical Association, 116(534), 632\u2013644. https:\/\/doi.org\/10.1080\/01621459.2020.1775612\nDatasets should be placed in the corresponding folder in the Bases\\Data_other folder.","name":"Replication data for Causal Inference With Interference and Noncompliance in Two-Stage Randomized Experiments.","access_type":"The data is publicly available but cannot be redistributed. Data can be downloaded in the Data URL.","uri":"https:\/\/dataverse.harvard.edu\/dataset.xhtml?persistentId=doi:10.7910\/DVN\/N7D9LS"}],"technology_environment":"\u2013 OS: Windows 10 Enterprise 22H2\n\u2013 Processor: Intel(R) Xeon(R) CPU E7- 4860 @ 2.27GHz 2.26 GHz (2 processors)\n\u2013 Memory available: 16 GB\n\u2013 Software version: Stata 17 MP Parallel Edition for Windows (64-bit x86-64), R version 4.4.2\n","reproduction_instructions":"1. *Secure Access to Data:* Some of the data required for replication is not publicly available.\n2. *Download and Place Data:* Once the data is obtained, users should download it and place it in the appropriate folder.\n3. *Run the Package:*\n    * Edit directory paths in the 0_master.do file.\n    * Edit the csv file path in the 1_Table_1.R script.\n\nSince not all the data is publicly available, the package includes the code outputs in the folder \"Results\", which contains the results produced by replicators. These files can be used to review the results presented in the paper."},"tags":[{"tag":"Code Access"},{"tag":"Data Access"},{"tag":"DOI"}],"schematype":"script"}