Reproducible Research Repository
Reproducible Research Repository
  • Home
  • Repository
  • Collections
  • About
    Home / Repository / JA / PP_IND_2024_81
ja

Reproducibility package for Wealth, Marriage, and Sex Selection

2025
Get Reproducibility Package
Reference ID
PP_IND_2024_81
DOI
https://doi.org/10.60572/v977-sw59
Author(s)
Girija Borker, Jan Eeckhout, Nancy Luke, Shantidani Minz, Kaivan Munshi, Soumya Swaminathan
Collections
Journal articles
Metadata
JSON
Created on
Feb 09, 2024
Last modified
May 06, 2025
  • Project Description
  • Downloads
  • Overview
  • Reproducibility Package
  • Description
  • Scope and coverage
  • Disclaimer
  • Access and rights
  • Contacts
  • Information on metadata
  • Citation
  • Overview

    Abstract

    Two mechanisms have been proposed to explain sex selection in India: son preference in which parents desire a male heir and daughter aversion in which dowry payments make parents worse off with girls. Our model incorporates both mechanisms, providing micro-foundations, based on the organization of the marriage institution, for daughter aversion. Marital matching, sex selection, and dowries are jointly determined in the model, whose implications are tested on a representative sample of rural households. Simulations of the model indicate that existing policies targeting daughter aversion might exacerbate the problem, while identifying other policies that could be effective.

    Reproducibility Package

    Scripts
    Readme Get Reproducibility Package
    Link: https://reproducibility.worldbank.org/index.php/catalog/105/download/804/README.pdf
    Reproducibility package for Wealth, Marriage, and Sex Selection
    Title
    Reproducibility package for Wealth, Marriage, and Sex Selection
    Date
    2025-04
    Dependencies
    Stata dependencies are listed in the ado folder. Matlab: global optimization toolbox
    Instructions
    See README in the reproducibility package.
    Notes
    Computational reproducibility verified by the Development Impact (DIME) Analytics team, World Bank.
    Source code repository
    Repository name URI
    Reproducible Research Repository (World Bank) https://reproducibility.worldbank.org
    Software
    Matlab
    Name
    Matlab
    Version
    2024b
    Stata
    Name
    Stata
    Version
    18 MP

    Reproducibility

    Technology environment

    Paper exhibits were reproduced in two computers with the following specifications:

    Matlab code:

    • OS: Windows 10 Enterprise
    • Processor: Intel(R) Xeon(R) Gold 6132 CPU @ 2.60GHz, 2600 Mhz, 16 Core(s), 16 Logical Processor(s)
    • Memory available: 128 GB
    • Software version: Matlab 2024b

    Stata code:

    • OS: Windows Server 2019 Standard
    • Processor: Intel(R) Xeon(R) CPU E7-4850 v4 @ 2.10GHz 2.10GHz (16 processors)
    • Memory available: 64 GB
    • Software version: Stata 18.0 MP
    Technology requirements

    Runtime: 3 weeks

    Reproduction instructions
    1. Secure Access to Data: Download the datasets not included in the package. Some data required for replication is not publicly available and needs to be purchased. See subsection Datasets and the README for more details
    2. Download and Place Data: Once the data is downloaded, users should place it in the appropriate folder.
    3. Run the Package: After placing the data in the folder:
      • Update the directory in the scripts 1_master, 2_master, and 3_master.
      • Run the code in the order specified in the README.

    Since all the data is not included, the package includes the results produced by replicators in the Results folder. These files can be used to review the results presented in the paper.

    Data

    Datasets
    South India Community Health Study (SICHS)
    Name
    South India Community Health Study (SICHS)
    Note
    Source: The World Bank Microdata Library Notes: The datasets available on the Microdata Library differ slightly from those used by the reproducibility team during verification, as the public versions exclude variables containing personally identifiable information (PII). As a result, some code files may not run in full. Specifically, the following do-files are affected: code/stata/subcode/1_1_ind_cleaning_akhil.do; code/stata/7_bootstrap_breg_ar.do; code/stata/15_marriage_analysis_ar1.do. All datasets listed below are available on the Microdata Library. Datasets marked with an asterisk (*) differ from those used in the verification process due to the exclusion of PII variables: (1) Located at data/raw/FPR. File names: FPR_SEC_C_P1, FPR_SEC_E_P2. (2) Located at data/raw/MPR. File names: MPR_SEC_A_edit, MPR_SEC_A_PGI, MPR_SEC_E_comb, MPR_SEC_E_P2, MPR_SEC_S_comb. (3) Located at *data/raw/ROSTER. File names: HOUSEROSTER_P2, HOUSEROSTER_P2_edit, HOUSEROSTER_P2forE3_edit, HOUSEROSTER_P2forE10_edit. (4) Located at data/raw/SPO. File name: SPO_SEC_B. (5) Located at *data/raw/2017-11-23_married_couples_pred_income_1871.dta. (6) Located at *data/raw/2017-12-07-Dowry_income.dta. (7) Located at *data/raw/CompleteData_Rough_10Oct2015.dta.
    Access policy
    The datasets are available as licensed files.
    License URL
    https://microdata.worldbank.org/index.php/terms-of-use
    Data URL
    https://microdata.worldbank.org/index.php/catalog/6652/study-description#doc_desc.title_statement
    Madras District Village Census, 1871 (North Arcot, South Arcot, Chingleput, Salem)
    Name
    Madras District Village Census, 1871 (North Arcot, South Arcot, Chingleput, Salem)
    Note
    Source: Data compiled by the authors in 2013 using the books listed below, accessed at the British Library in London. Books: Madras Districts: Census statement of population of 1871 in each village of the North Arcot District arranged according to area, caste and occupation. Madras: Foster Press, 1874. Pages: 1 – 492; Madras Districts: Census statement of population of 1871 in each village of the South Arcot District arranged according to area, caste and occupation. Madras: Adelphi Press, 1874. Pages: 1 – 269; Madras Districts: Census statement of population of 1871 in each village of the Chingleput District arranged according to area, caste and occupation. Madras: Scottish Press, 1874. Pages: 1-199; Madras Districts: Census statement of population of 1871 in each village of the Salem District arranged according to area, caste and occupation. Madras: Lawrence Asylum Press, 1874. Pages: 1 – 432. Located at: data/raw File names: Arcot_area.csv, Arcot_caste.csv, Arcot_occupation.csv, Master_v1.csv, Predicted1871IncomeForSICHS.dta, TamilNadu-Habitations_Original.dta, Tirupattu_area.csv, Tirupattur_caste.csv, Tirupattur_occupation.csv, Wallajah_area.csv, Wallajah_caste.csv, Wallajah_occupation.csv
    Access policy
    The data is not included in the reproducibility package, and no formal access procedure is documented. As the data was compiled directly from physical materials at the British Library, it cannot be redistributed.
    Demographic and Socioeconomic Indicators – South India
    Name
    Demographic and Socioeconomic Indicators – South India
    Note
    Source: Data accessed from Indiastat.com Notes: The table "Age-Group Wise Percentage of Population by Sex and Marital Status (Rural and Urban)" was downloaded for Maharashtra, Karnataka, Andhra Pradesh, and Tamil Nadu for the year 2011, and additionally for Karnataka for 2018. For India, the following tables were used: State-wise Total Population by Residence and Sex (2011 Census); Labour Force Participation Rate (LFPR) by Different Age Groups (in per 1000 persons) and Sex in Rural India (2009–2010); State-wise Religious Population by Residence; State-wise Child Sex Ratio (Age Group 0–6 Years) by Residence; State-wise Sex Ratio (Female per 1000 Males); State-wise Literacy Rate by Residence and Sex. Located at: data/raw File name: tableA1_S1_TN_SICHS comparison.xlsx
    Access policy
    The data is not included in the package. It can be accessed by obtaining a paid subscription to Indiastat and following the access instructions provided in the README.
    Unweighted Random Samples of 12 Caste Identifiers
    Name
    Unweighted Random Samples of 12 Caste Identifiers
    Note
    Source: Data complied by the authors. Notes: Instructions to create the dataset are included in the README. Located at: data/raw File name: unweighted_samples_12_castes.csv
    Access policy
    Included in the package.
    Demographic and Health Surveys (DHS)
    Name
    Demographic and Health Surveys (DHS)
    Note
    Source: India: Standard DHS, 2005–06 Dataset; India: Standard DHS, 2015–16 Dataset. Notes: To download the datasets, please register on the website, request access to the India data, and download the Individual Recode Stata datasets for 2005–06 and 2015–16. Located at: data/raw File names: IAIR52FL.dta and IAIR74FL.dta
    Access policy
    The dataset is not included in the package. Currently, the data cannot be accessed, as new user registration on The DHS Program website is temporarily unavailable due to an ongoing review of U.S. foreign assistance programs.
    License URL
    https://dhsprogram.com/data/Terms-of-Use.cfm
    Data statement

    Some data is restricted and has not been included in the reproducibility package. For more details, please refer to the README file.

    Description

    Output
    Wealth, Marriage, and Sex Selection
    Type
    Journal article
    Title
    Wealth, Marriage, and Sex Selection
    Authors
    Girija Borker, Jan Eeckhout, Nancy Luke, Shantidani Minz, Kaivan Munshi, Soumya Swaminathan
    Authors
    Author Affiliation Email
    Girija Borker World Bank and IZA gborker@workdbank.org
    Jan Eeckhout UPF Barcelona (ICREA-BSE-CREi) jan.eeckhout@upf.edu
    Nancy Luke Pennsylvania State University nkl10@psu.edu
    Shantidani Minz Christian Medical College, Vellore shantidanim@cmcvellore.ac.in
    Kaivan Munshi Yale University and Toulouse School of Economics kaivan.munshi@yale.edu
    Soumya Swaminathan World Health Organization swaminathans@who.int
    Date of production

    2025-05

    Scope and coverage

    Geographic locations
    Location Code
    India IND
    Keywords
    Family Economics Social Norms Marriage Market Sex Selection Caste Assortative Matching Wealth Distribution Inequality Control Function
    Topics
    ID Topic Parent topic ID Vocabulary Vocabulary URI
    J12 Marriage • Marital Dissolution • Family Structure • Domestic Abuse J1 Journal of Economic Literature (JEL)
    J16 Economics of Gender • Non-labor Discrimination J1 Journal of Economic Literature (JEL)
    D31 Personal Income, Wealth, and Their Distributions D3 Journal of Economic Literature (JEL)
    I3 Welfare, Well-Being, and Poverty I3 Journal of Economic Literature (JEL)

    Disclaimer

    Disclaimer

    The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.

    Access and rights

    License
    Name URI
    Modified BSD3 https://opensource.org/license/bsd-3-clause/

    Contacts

    Contacts
    Name Affiliation Email
    Girija Borker World Bank gborker@worldbank.org
    Reproducibility WB World Bank reproducibility@worldbank.org

    Information on metadata

    Producers
    Name Abbreviation Affiliation Role
    Reproducibility WBG DIME World Bank - Development Impact Department Verification and preparation of metadata
    Date of Production

    2025-05-05

    Document version

    1

    Citation

    Citation
    loading, please wait...
    Citation format
    Export citation: RIS | BibTeX | Plain text
    Back to Catalog
    The World Bank Working for a World Free of Poverty
    • IBRD IDA IFC MIGA ICSID

    © The World Bank Group, All Rights Reserved.