Dummy Endogenous Treatment Effect Estimation Using High-Dimensional Instrumental Variables
The Canadian Journal of Statistics
26 Pages Posted: 12 Jun 2020 Last revised: 30 Mar 2021
Date Written: February 1, 2020
Abstract
In this paper, we develop a two-stage approach to estimate the treatment effects of dummy endogenous variables using high-dimensional instrumental variables (IVs). In the first stage, instead of using a conventional linear reduced-form regression to approximate the optimal instrument, we propose a penalized logistic reduced-form model to accommodate both the binary nature of the endogenous treatment variable and the high dimensionality of the instrumental variables. In the second stage, we replace the original treatment variable with its estimated propensity score and run a least-squares regression to obtain a penalized Logistic-regression Instrumental Variables Estimator (LIVE). We show theoretically that the proposed LIVE is root-n consistent with the true treatment effect and asymptotically normal. Monte Carlo simulations demonstrate that the LIVE is more efficient than existing IV estimators for endogenous treatment effects. In applications, we use the LIVE to investigate whether the Olympic Games facilitate the host nation's economic growth and whether home visits from teachers enhance students' academic performance. In addition, the R functions for the proposed algorithms have been developed in an R package, naivereg.
Keywords: Dummy variable, endogeneity, instrumental variable, logistic regression, treatment effect, variable selection
JEL Classification: C21, C25, C26, C55
Suggested Citation: Suggested Citation