TY - JOUR
T1 - A new test for high-dimensional regression coefficients in partially linear models
AU - Zhao, Fanrong
AU - Lin, Nan
AU - Zhang, Baoxue
N1 - Publisher Copyright:
© 2021 Statistical Society of Canada.
PY - 2023/3
Y1 - 2023/3
N2 - Partially linear regression models are semiparametric models that contain both linear and nonlinear components. They are extensively used in many scientific fields for their flexibility and convenient interpretability. In such analyses, testing the significance of the regression coefficients in the linear component is typically a key focus. Under the high-dimensional setting, i.e., “large p, small n,” the conventional F-test strategy does not apply because the coefficients need to be estimated through regularization techniques. In this article, we develop a new test using a U-statistic of order two, relying on a pseudo-estimate of the nonlinear component from the classical kernel method. Using the martingale central limit theorem, we prove the asymptotic normality of the proposed test statistic under some regularity conditions. We further demonstrate our proposed test's finite-sample performance by simulation studies and by analyzing some breast cancer gene expression data.
AB - Partially linear regression models are semiparametric models that contain both linear and nonlinear components. They are extensively used in many scientific fields for their flexibility and convenient interpretability. In such analyses, testing the significance of the regression coefficients in the linear component is typically a key focus. Under the high-dimensional setting, i.e., “large p, small n,” the conventional F-test strategy does not apply because the coefficients need to be estimated through regularization techniques. In this article, we develop a new test using a U-statistic of order two, relying on a pseudo-estimate of the nonlinear component from the classical kernel method. Using the martingale central limit theorem, we prove the asymptotic normality of the proposed test statistic under some regularity conditions. We further demonstrate our proposed test's finite-sample performance by simulation studies and by analyzing some breast cancer gene expression data.
KW - Asymptotic normality
KW - Nadaraya–Watson estimator
KW - U-statistic
KW - high-dimensional partially linear model
UR - https://www.scopus.com/pages/publications/85120381810
U2 - 10.1002/cjs.11665
DO - 10.1002/cjs.11665
M3 - Article
AN - SCOPUS:85120381810
SN - 0319-5724
VL - 51
SP - 5
EP - 18
JO - Canadian Journal of Statistics
JF - Canadian Journal of Statistics
IS - 1
ER -