{"type":"script","doc_desc":{"producers":[{"name":"Reproducibility WBG","abbr":"DECDI","affiliation":"World Bank - Development Impact Department","role":"Verification and preparation of metadata"}],"prod_date":"2026-04-21","version":"1"},"project_desc":{"authoring_entity":[{"name":"David Newhouse","affiliation":"World Bank ","email":"dnewhouse@worldbank.org"},{"name":"Hai-Anh Dang","affiliation":"World Bank","email":"hdang1@worldbank.org"},{"name":"Minh Do","affiliation":"World Bank","email":"minh.nn.do@gmail.com"},{"name":"Partha Lahiri","affiliation":"University of Maryland College Park","email":"plahiri@umd.edu"},{"name":"Melany Gualavisi","affiliation":"University of Illinois ","email":"melanyg2@illinois.edu"},{"name":"Talip Kilic","affiliation":"World Bank ","email":"tkilic@worldbank.org"},{"name":"Peter Lanjouw","affiliation":"Vrije University Amsterdam","email":"p.f.lanjouw@vu.nl"},{"name":"Roy Van der Weide","affiliation":"World Bank ","email":"rvanderweide@worldbank.org"}],"title_statement":{"title":"Reproducibility package for Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data","idno":"RR_LAC_2026_608"},"data_statement":"All data sources are publicly available but not included in the reproducibility package. Only the intermediate data is included in the package.","software":[{"name":"Stata","version":"19.5 MP"}],"scripts":[{"title":"Reproducibility package for Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data","date":"2026-04","notes":"Computational reproducibility verified by Development Impact (DECDI) Analytics team, World Bank.","instructions":"See README in reproducibility package.","file_name":"RR_LAC_2026_608","zip_package":"RR_LAC_2026_608.zip","dependencies":"Stata dependencies are listed in the ado folder."}],"repository_uri":[{"name":"Reproducible Research Repository (World Bank)","uri":"https:\/\/reproducibility.worldbank.org"}],"production_date":"2026-04-21","abstract":"This paper uses five rounds of Mexican and Brazilian census extracts to evaluate the accuracy of different model specifications and estimation methods that use survey and census data to generate small area estimates of poverty. Models that utilize more granular data for prediction (i.e., household- and\/or village-level predictors) tend to produce more accurate estimates of poverty than models estimated only using area-level predictors. Differences in accuracy across models and methods that utilize household or village level predictors are minor. Models that omit household-level predictors tend to be more robust than unit-level models to the use of old census data and classical measurement error in survey predictors. The performance of the Fay-Herriot area-level model falls in the presence of sample selection bias and small sample sizes. Rescaling sample weights is important in Mexico, where the sample is informative within areas. Applying raw sample weights without rescaling in this case greatly reduces the accuracy of estimates from linear models and distorts methodological comparisons. Overall, no one approach dominates across all contexts, but when sample weights are rescaled there is no downside to using more granular data for prediction.","geographic_units":[{"name":"Latin America","code":"LAC"}],"keywords":[{"name":"Small Area Estimation"},{"name":"Poverty Mapping"}],"topics":[{"id":"C51","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Model Construction and Estimation","parent_id":"C5"},{"id":" C52","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Model Evaluation, Validation, and Selection","parent_id":"C5"},{"id":" I32","uri":"https:\/\/www.aeaweb.org\/econlit\/jelCodes.php?view=jel","vocabulary":"Journal of Economic Literature (JEL)","name":"Measurement and Analysis of Poverty","parent_id":"I3"}],"output":[{"type":"Working Paper","description":"Policy Research Working Papers (PRWP)","title":"Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data"}],"language":[{"name":"English","code":"EN"}],"disclaimer":"The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development\/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.","license":[{"name":"Modified BSD3","uri":"https:\/\/opensource.org\/license\/bsd-3-clause\/"}],"contacts":[{"name":"David Newhouse","affiliation":"World Bank","email":"dnewhouse@worldbank.org"},{"name":"Reproducibility WBG","affiliation":"World Bank","email":"reproducibility@worldbank.org"}],"datasets":[{"name":"Census of Population and Housing (CPV) 2010","note":"Source: INEGI (Instituto Nacional de Estad\u00edstica y Geograf\u00eda)","citation":"INEGI. 2010. \"Census of Population and Housing (CPV)\" [dataset]. https:\/\/en.www.inegi.org.mx\/programas\/ccpv\/2010\/#microdata. Accessed October, 2023.","uri":"https:\/\/en.www.inegi.org.mx\/programas\/ccpv\/2010\/#microdata","license_uri":"https:\/\/en.www.inegi.org.mx\/inegi\/terminos.html","access_type":"Data is publicly available but not included in the reproducibility package due to file size constraints."},{"name":"Intercensal Survey (EIC) 2015 ","uri":"https:\/\/en.www.inegi.org.mx\/programas\/intercensal\/2015\/","license_uri":"https:\/\/en.www.inegi.org.mx\/inegi\/terminos.html","access_type":"Data is publicly available but not included in the reproducibility package due to file size constraints.","note":"Source: INEGI (Instituto Nacional de Estad\u00edstica y Geograf\u00eda)","citation":"INEGI. 2015. \"Intercensal Survey (EIC) 2015\" [dataset]. https:\/\/en.www.inegi.org.mx\/programas\/intercensal\/2015\/"},{"name":"Census of Population and Housing (2020) ","note":"Source: INEGI (Instituto Nacional de Estad\u00edstica y Geograf\u00eda)","access_type":"Data is publicly available but not included in the reproducibility package due to file size constraints.","citation":"INEGI. 2020. \"Census of Population and Housing (2020)\" [dataset]. https:\/\/en.www.inegi.org.mx\/programas\/ccpv\/2020\/.","uri":"https:\/\/en.www.inegi.org.mx\/programas\/ccpv\/2020\/","license_uri":"https:\/\/en.www.inegi.org.mx\/inegi\/terminos.html"},{"name":"2000 Population Census ","citation":"IBGE. 2000. \"2000 Population Census\" [dataset]. https:\/\/www.ibge.gov.br\/en\/statistics\/social\/population\/18521-2000-population-census.html?edicao=18553.","note":"Source: IBGE (nstituto Brasileiro de Geografia e Estat\u00edstica)","license_uri":"https:\/\/www.planalto.gov.br\/ccivil_03\/Portaria\/P130-21-ccivil.htm#art5","uri":"https:\/\/www.ibge.gov.br\/en\/statistics\/social\/population\/18521-2000-population-census.html?edicao=18553","access_type":"Data is publicly available but not included in the reproducibility package due to file size constraints."},{"name":"2010 Population Census ","note":"Source: IBGE (nstituto Brasileiro de Geografia e Estat\u00edstica)","access_type":"Data is publicly available but not included in the reproducibility package due to file size constraints.","uri":"https:\/\/www.ibge.gov.br\/en\/statistics\/social\/health\/18391-2010-population-census.html.","license_uri":"https:\/\/www.planalto.gov.br\/ccivil_03\/Portaria\/P130-21-ccivil.htm#art5","citation":"IBGE. 2010. \"2010 Population Census\" [dataset]. https:\/\/www.ibge.gov.br\/en\/statistics\/social\/health\/18391-2010-population-census.html."}],"reproduction_instructions":"The package uses intermediate data. The code used to process the raw data into intermediate data is included in the `data construction` folder for transparency. However, we did not verify the code that generates the intermediate data, as it takes over a month to run due to the large size of the datasets. Instead, reviewers verified the outputs generated from the intermediate data included in the package.\nTo reproduce the exhibits in this paper, a new user should follow these steps:\n1. Update the working directory in the `master.do` file and run the code.\n2. All outputs will be generated in the Excel file `tables.xlsx`. Some figures are exported as values, and the graphs are created manually in the Excel file.\n\nPlease note while the data construction code is included in the package, users will only be able to run it if they obtain access to the raw data. See the Datasets section for more details.\n","technology_requirements":"Runtime: 6 minutes","technology_environment":"Paper exhibits were reproduced on a computer with the following specifications:\n\u2022 OS: Windows 11 Enterprise\n\u2022 Processor: Intel(R) Xeon(R) Gold 5218 CPU @ 2.30GHz, 2300 Mhz, 4 Core(s), 4 Logical Processor(s)\n\u2022 Memory available: 8.15 GB\n\u2022 Software version: Stata 19.5 MP"},"datacite":{"creators":[{"givenName":"David","familyName":"Newhouse","nameType":"Personal","affiliation":[{"name":"World Bank Group"}]},{"givenName":"Hai-Anh","familyName":"Dang","nameType":"Personal","affiliation":[{"name":"World Bank"}]},{"givenName":"Minh","familyName":"Do","nameType":"Personal","affiliation":[{"name":"World Bank"}]},{"givenName":"Partha","familyName":"Lahiri","nameType":"Personal","affiliation":[{"name":"University of Maryland College Park"}]},{"givenName":"Melany","familyName":"Gualavisi","nameType":"Personal","affiliation":[{"name":"Amazon.com"}]},{"givenName":"Talip","familyName":"Kilic","nameType":"Personal","affiliation":[{"name":"World Bank Group"}]},{"givenName":"Peter","familyName":"Lanjouw","nameType":"Personal","affiliation":[{"name":"Vrije University Amsterdam"}]},{"givenName":"Roy Van der","familyName":"Weide","nameType":"Personal","affiliation":[{"name":"World Bank Group"}]}],"titles":[{"lang":"en","title":"Reproducibility package for Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data"},{"title":"RR_LAC_2026_608","titleType":"Other"}],"publisher":"World Bank","publicationYear":"2026","types":{"resourceType":"Reproducibility package","resourceTypeGeneral":"Other"},"url":"https:\/\/reproducibility.worldbank.org\/index.php\/catalog\/study\/RR_LAC_2026_608","language":"en"},"tags":[{"tag":"DOI"},{"tag":"Open Code"},{"tag":"Open Data"}],"schematype":"script"}