TY - JOUR
T1 - Debiased Inference on Treatment Effect in a High-Dimensional Model
AU - Wang, Jingshen
AU - He, Xuming
AU - Xu, Gongjun
N1 - Publisher Copyright:
© 2019, © 2019 American Statistical Association.
PY - 2020/1/2
Y1 - 2020/1/2
N2 - This article concerns the potential bias in statistical inference on treatment effects when a large number of covariates are present in a linear or partially linear model. While the estimation bias in an under-fitted model is well understood, we address a lesser-known bias that arises from an over-fitted model. The over-fitting bias can be eliminated through data splitting at the cost of statistical efficiency, and we show that smoothing over random data splits can be pursued to mitigate the efficiency loss. We also discuss some of the existing methods for debiased inference and provide insights into their intrinsic bias-variance trade-off, which leads to an improvement in bias controls. Under appropriate conditions, we show that the proposed estimators for the treatment effects are asymptotically normal and their variances can be well estimated. We discuss the pros and cons of various methods both theoretically and empirically, and show that the proposed methods are valuable options in post-selection inference. Supplementary materials for this article are available online.
AB - This article concerns the potential bias in statistical inference on treatment effects when a large number of covariates are present in a linear or partially linear model. While the estimation bias in an under-fitted model is well understood, we address a lesser-known bias that arises from an over-fitted model. The over-fitting bias can be eliminated through data splitting at the cost of statistical efficiency, and we show that smoothing over random data splits can be pursued to mitigate the efficiency loss. We also discuss some of the existing methods for debiased inference and provide insights into their intrinsic bias-variance trade-off, which leads to an improvement in bias controls. Under appropriate conditions, we show that the proposed estimators for the treatment effects are asymptotically normal and their variances can be well estimated. We discuss the pros and cons of various methods both theoretically and empirically, and show that the proposed methods are valuable options in post-selection inference. Supplementary materials for this article are available online.
KW - Data splitting
KW - De-sparsified Lasso
KW - Post-selection inference
UR - https://www.scopus.com/pages/publications/85064812050
U2 - 10.1080/01621459.2018.1558062
DO - 10.1080/01621459.2018.1558062
M3 - Article
AN - SCOPUS:85064812050
SN - 0162-1459
VL - 115
SP - 442
EP - 454
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 529
ER -