Bangladesh's fertilizer subsidy costs $2.5 billion annually and accounts for nearly two-thirds of agricultural spending, yet its distributional impact remains unknown due to data limitations. This impedes reform of a policy that may favor larger farmers while crowding out investment in public goods, the real engines for long-term productivity growth. The study develops a survey-to-survey imputation method to address this gap: the Household Income and Expenditure Survey (HIES) 2022 records total fertilizer expenditure but not subsidized types, preventing accurate incidence analysis. The method combines cross-validated LASSO regression with randomized hot-deck matching to transfer type-specific fertilizer patterns from the Bangladesh Integrated Household Survey (BIHS) 2018-2019 to HIES 2022. Our procedure predicts household urea shares using 42 harmonized predictors, then assigns complete fertilizer compositions through nearest-neighbor matching within welfare-by-agro-ecological strata. The method achieves strong predictive accuracy (test RMSE = 0.169) and preserves distributional properties. Imputed shares replicate donor patterns closely: mean urea share is 51.5 percent versus 51.1 percent in BIHS, with overlapping confidence intervals across fertilizer types and regions. The enriched dataset provides the foundation for assessing whether subsidy benefits are concentrated among larger, wealthier farmers or distributed more equitably across farm households—a question previously unanswerable with existing data. More broadly, the study demonstrates a scalable framework for integrating complementary surveys in data-constrained settings.
| Repository name | URI |
|---|---|
| Reproducible Research Repository (World Bank) | https://reproducibility.worldbank.org |
Paper exhibits were reproduced on a computer with the following specifications:
• OS: Windows 11 Enterprise
• Processor: Intel(R) Core(TM) i5-1145G7 CPU @ 2.60GHz
• Memory available: 15.7 GB
Runtime: 30 minutes
To reproduce the exhibits in this paper, follow these steps:
00_master_stage1_harmonization, update the working directory, and run the code.fertilizer_s2s_reproducibility.Rproj.renv::restore(), or manually install the packages listed in renv.lock.03_figure1_kernel_density, update the working directory, and run the script.05_s2s_imputation, update the working directory, and run the script.00_master_stage3, update the working directory, install the required packages, and run the code.Since the data is not included in the package, the outputs produced by the verification team are included so that users can compare them with the published manuscript.
Some data is restricted and has not been included in the reproducibility package. For more details, please refer to the README file.
| Author | Affiliation | |
|---|---|---|
| FNU Jonaed | World Bank | fjonaed@worldbank.org |
| Ivan Gachet | World Bank | igachet@worldbank.org |
| Leopoldo Tornarolli | World Bank | tornarolli@gmail.com |
2026-04-17
| Location | Code |
|---|---|
| Bangladesh | BGD |
The materials in the reproducibility packages are distributed as they were prepared by the staff of the International Bank for Reconstruction and Development/The World Bank. The findings, interpretations, and conclusions expressed in this event do not necessarily reflect the views of the World Bank, the Executive Directors of the World Bank, or the governments they represent. The World Bank does not guarantee the accuracy of the materials included in the reproducibility package.
| Name | URI |
|---|---|
| Modified BSD3 | https://opensource.org/license/bsd-3-clause/ |
| Name | Affiliation | |
|---|---|---|
| FNU Jonaed | World Bank | fjonaed@worldbank.org |
| Reproducibility WBG | World Bank | reproducibility@worldbank.org |
| Name | Abbreviation | Affiliation | Role |
|---|---|---|---|
| Reproducibility WBG | DECDI | World Bank - Development Impact Department | Verification and preparation of metadata |
2026-04-17
1