Reproducible Research Repository
Reproducible Research Repository
  • Home
  • Repository
  • Collections
  • About
    Home / Repository / PRWP / RR_LAC_2026_608
PRWP

Reproducibility package for Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data

2026
Get Reproducibility Package
Reference ID
RR_LAC_2026_608
DOI
https://doi.org/10.60572/24rv-b560
Author(s)
David Newhouse, Hai-Anh Dang, Minh Do, Partha Lahiri, Melany Gualavisi, Talip Kilic, Peter Lanjouw, Roy Van der Weide
Collections
World Bank Policy Research Working Papers
Metadata
JSON
Created on
Apr 21, 2026
Last modified
May 07, 2026
  • Project Description
  • Downloads
  • Overview
  • Reproducibility Package
  • Description
  • Scope and coverage
  • Disclaimer
  • Access and rights
  • Contacts
  • Information on metadata
  • Citation
  • Overview

    Abstract

    This paper uses five rounds of Mexican and Brazilian census extracts to evaluate the accuracy of different model specifications and estimation methods that use survey and census data to generate small area estimates of poverty. Models that utilize more granular data for prediction (i.e., household- and/or village-level predictors) tend to produce more accurate estimates of poverty than models estimated only using area-level predictors. Differences in accuracy across models and methods that utilize household or village level predictors are minor. Models that omit household-level predictors tend to be more robust than unit-level models to the use of old census data and classical measurement error in survey predictors. The performance of the Fay-Herriot area-level model falls in the presence of sample selection bias and small sample sizes. Rescaling sample weights is important in Mexico, where the sample is informative within areas. Applying raw sample weights without rescaling in this case greatly reduces the accuracy of estimates from linear models and distorts methodological comparisons. Overall, no one approach dominates across all contexts, but when sample weights are rescaled there is no downside to using more granular data for prediction.

    Reproducibility Package

    Scripts
    Readme Get Reproducibility Package
    Link: https://reproducibility.worldbank.org/catalog/538/download/1589/README.pdf
    Reproducibility package for Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data
    File name
    RR_LAC_2026_608
    Zip package
    RR_LAC_2026_608.zip
    Title
    Reproducibility package for Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data
    Date
    2026-04
    Dependencies
    Stata dependencies are listed in the ado folder.
    Instructions
    See README in reproducibility package.
    Notes
    Computational reproducibility verified by Development Impact (DECDI) Analytics team, World Bank.
    Source code repository
    Repository name URI
    Reproducible Research Repository (World Bank) https://reproducibility.worldbank.org
    Software
    Stata
    Name
    Stata
    Version
    19.5 MP

    Reproducibility

    Technology environment

    Paper exhibits were reproduced on a computer with the following specifications:
    • OS: Windows 11 Enterprise
    • Processor: Intel(R) Xeon(R) Gold 5218 CPU @ 2.30GHz, 2300 Mhz, 4 Core(s), 4 Logical Processor(s)
    • Memory available: 8.15 GB
    • Software version: Stata 19.5 MP

    Technology requirements

    Runtime: 6 minutes

    Reproduction instructions

    The package uses intermediate data. The code used to process the raw data into intermediate data is included in the data construction folder for transparency. However, we did not verify the code that generates the intermediate data, as it takes over a month to run due to the large size of the datasets. Instead, reviewers verified the outputs generated from the intermediate data included in the package.
    To reproduce the exhibits in this paper, a new user should follow these steps:

    1. Update the working directory in the master.do file and run the code.
    2. All outputs will be generated in the Excel file tables.xlsx. Some figures are exported as values, and the graphs are created manually in the Excel file.

    Please note while the data construction code is included in the package, users will only be able to run it if they obtain access to the raw data. See the Datasets section for more details.

    Data

    Datasets
    Census of Population and Housing (CPV) 2010
    Name
    Census of Population and Housing (CPV) 2010
    Note
    Source: INEGI (Instituto Nacional de Estadística y Geografía)
    Access policy
    Data is publicly available but not included in the reproducibility package due to file size constraints.
    License URL
    https://en.www.inegi.org.mx/inegi/terminos.html
    Data URL
    https://en.www.inegi.org.mx/programas/ccpv/2010/#microdata
    Citation
    INEGI. 2010. "Census of Population and Housing (CPV)" [dataset]. https://en.www.inegi.org.mx/programas/ccpv/2010/#microdata. Accessed October, 2023.
    Intercensal Survey (EIC) 2015
    Name
    Intercensal Survey (EIC) 2015
    Note
    Source: INEGI (Instituto Nacional de Estadística y Geografía)
    Access policy
    Data is publicly available but not included in the reproducibility package due to file size constraints.
    License URL
    https://en.www.inegi.org.mx/inegi/terminos.html
    Data URL
    https://en.www.inegi.org.mx/programas/intercensal/2015/
    Citation
    INEGI. 2015. "Intercensal Survey (EIC) 2015" [dataset]. https://en.www.inegi.org.mx/programas/intercensal/2015/
    Census of Population and Housing (2020)
    Name
    Census of Population and Housing (2020)
    Note
    Source: INEGI (Instituto Nacional de Estadística y Geografía)
    Access policy
    Data is publicly available but not included in the reproducibility package due to file size constraints.
    License URL
    https://en.www.inegi.org.mx/inegi/terminos.html
    Data URL
    https://en.www.inegi.org.mx/programas/ccpv/2020/
    Citation
    INEGI. 2020. "Census of Population and Housing (2020)" [dataset]. https://en.www.inegi.org.mx/programas/ccpv/2020/.
    2000 Population Census
    Name
    2000 Population Census
    Note
    Source: IBGE (nstituto Brasileiro de Geografia e Estatística)
    Access policy
    Data is publicly available but not included in the reproducibility package due to file size constraints.
    License URL
    https://www.planalto.gov.br/ccivil_03/Portaria/P130-21-ccivil.htm#art5
    Data URL
    https://www.ibge.gov.br/en/statistics/social/population/18521-2000-population-census.html?edicao=18553
    Citation
    IBGE. 2000. "2000 Population Census" [dataset]. https://www.ibge.gov.br/en/statistics/social/population/18521-2000-population-census.html?edicao=18553.
    2010 Population Census
    Name
    2010 Population Census
    Note
    Source: IBGE (nstituto Brasileiro de Geografia e Estatística)
    Access policy
    Data is publicly available but not included in the reproducibility package due to file size constraints.
    License URL
    https://www.planalto.gov.br/ccivil_03/Portaria/P130-21-ccivil.htm#art5
    Data URL
    https://www.ibge.gov.br/en/statistics/social/health/18391-2010-population-census.html.
    Citation
    IBGE. 2010. "2010 Population Census" [dataset]. https://www.ibge.gov.br/en/statistics/social/health/18391-2010-population-census.html.
    Data statement

    All data sources are publicly available but not included in the reproducibility package. Only the intermediate data is included in the package.

    Description

    Output
    Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data
    Type
    Working Paper
    Title
    Evaluating Alternative Approaches To Small Area Estimation Of Poverty With Survey And Census Data
    Description
    Policy Research Working Papers (PRWP)
    Authors
    Author Affiliation Email
    David Newhouse World Bank dnewhouse@worldbank.org
    Hai-Anh Dang World Bank hdang1@worldbank.org
    Minh Do World Bank minh.nn.do@gmail.com
    Partha Lahiri University of Maryland College Park plahiri@umd.edu
    Melany Gualavisi University of Illinois melanyg2@illinois.edu
    Talip Kilic World Bank tkilic@worldbank.org
    Peter Lanjouw Vrije University Amsterdam p.f.lanjouw@vu.nl
    Roy Van der Weide World Bank rvanderweide@worldbank.org
    Date of production

    2026-04-21

    Scope and coverage

    Geographic locations
    Location Code
    Latin America LAC
    Keywords
    Small Area Estimation Poverty Mapping
    Topics
    ID Topic Parent topic ID Vocabulary Vocabulary URI
    C51 Model Construction and Estimation C5 Journal of Economic Literature (JEL)
    C52 Model Evaluation, Validation, and Selection C5 Journal of Economic Literature (JEL)
    I32 Measurement and Analysis of Poverty I3 Journal of Economic Literature (JEL)

    Disclaimer

    Disclaimer

    The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.

    Access and rights

    License
    Name URI
    Modified BSD3 https://opensource.org/license/bsd-3-clause/

    Contacts

    Contacts
    Name Affiliation Email
    David Newhouse World Bank dnewhouse@worldbank.org
    Reproducibility WBG World Bank reproducibility@worldbank.org

    Information on metadata

    Producers
    Name Abbreviation Affiliation Role
    Reproducibility WBG DECDI World Bank - Development Impact Department Verification and preparation of metadata
    Date of Production

    2026-04-21

    Document version

    1

    Citation

    Citation
    loading, please wait...
    Citation format
    Export citation: RIS | BibTeX | Plain text
    Back to Catalog
    The World Bank Working for a World Free of Poverty
    • IBRD IDA IFC MIGA ICSID

    © The World Bank Group, All Rights Reserved.