This paper uses five rounds of Mexican and Brazilian census extracts to evaluate the accuracy of different model specifications and estimation methods that use survey and census data to generate small area estimates of poverty. Models that utilize more granular data for prediction (i.e., household- and/or village-level predictors) tend to produce more accurate estimates of poverty than models estimated only using area-level predictors. Differences in accuracy across models and methods that utilize household or village level predictors are minor. Models that omit household-level predictors tend to be more robust than unit-level models to the use of old census data and classical measurement error in survey predictors. The performance of the Fay-Herriot area-level model falls in the presence of sample selection bias and small sample sizes. Rescaling sample weights is important in Mexico, where the sample is informative within areas. Applying raw sample weights without rescaling in this case greatly reduces the accuracy of estimates from linear models and distorts methodological comparisons. Overall, no one approach dominates across all contexts, but when sample weights are rescaled there is no downside to using more granular data for prediction.
| Repository name | URI |
|---|---|
| Reproducible Research Repository (World Bank) | https://reproducibility.worldbank.org |
Paper exhibits were reproduced on a computer with the following specifications:
• OS: Windows 11 Enterprise
• Processor: Intel(R) Xeon(R) Gold 5218 CPU @ 2.30GHz, 2300 Mhz, 4 Core(s), 4 Logical Processor(s)
• Memory available: 8.15 GB
• Software version: Stata 19.5 MP
Runtime: 6 minutes
The package uses intermediate data. The code used to process the raw data into intermediate data is included in the data construction folder for transparency. However, we did not verify the code that generates the intermediate data, as it takes over a month to run due to the large size of the datasets. Instead, reviewers verified the outputs generated from the intermediate data included in the package.
To reproduce the exhibits in this paper, a new user should follow these steps:
master.do file and run the code.tables.xlsx. Some figures are exported as values, and the graphs are created manually in the Excel file.Please note while the data construction code is included in the package, users will only be able to run it if they obtain access to the raw data. See the Datasets section for more details.
All data sources are publicly available but not included in the reproducibility package. Only the intermediate data is included in the package.
| Author | Affiliation | |
|---|---|---|
| David Newhouse | World Bank | dnewhouse@worldbank.org |
| Hai-Anh Dang | World Bank | hdang1@worldbank.org |
| Minh Do | World Bank | minh.nn.do@gmail.com |
| Partha Lahiri | University of Maryland College Park | plahiri@umd.edu |
| Melany Gualavisi | University of Illinois | melanyg2@illinois.edu |
| Talip Kilic | World Bank | tkilic@worldbank.org |
| Peter Lanjouw | Vrije University Amsterdam | p.f.lanjouw@vu.nl |
| Roy Van der Weide | World Bank | rvanderweide@worldbank.org |
2026-04-21
| Location | Code |
|---|---|
| Latin America | LAC |
The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.
| Name | URI |
|---|---|
| Modified BSD3 | https://opensource.org/license/bsd-3-clause/ |
| Name | Affiliation | |
|---|---|---|
| David Newhouse | World Bank | dnewhouse@worldbank.org |
| Reproducibility WBG | World Bank | reproducibility@worldbank.org |
| Name | Abbreviation | Affiliation | Role |
|---|---|---|---|
| Reproducibility WBG | DECDI | World Bank - Development Impact Department | Verification and preparation of metadata |
2026-04-21
1