TY - JOUR
T1 - An Efficient Testing Procedure for High-Dimensional Mediators with FDR Control
AU - Bai, Xueyan
AU - Zheng, Yinan
AU - Hou, Lifang
AU - Zheng, Cheng
AU - Liu, Lei
AU - Zhang, Haixiang
N1 - Publisher Copyright:
© The Author(s) under exclusive licence to International Chinese Statistical Association 2024.
PY - 2024
Y1 - 2024
N2 - The field of mediation analysis commonly explores the pathways that connect environmental exposures with health outcomes. With the development of data collection techniques, greater efforts have been dedicated to addressing high-dimensional mediators. In this paper, we present an efficient approach to identify significant mediators while controlling the false discovery rate (FDR). We propose a three-step procedure that incorporates independent screening, variable selection together with refitted partial regression, and divide-aggregate composite-null test (DACT). The simulation includes a comparative analysis of our proposed method in comparison to eight competing approaches, demonstrating that our procedure has significant advantages over other methods. The proposed procedure is applied to investigate the mediation mechanisms of DNA methylation in the relationship between smoking and lung function. Three specific methylation sites (cg26331243, cg19862839, and cg12616487) are identified as potential epigenetic markers involved in mediating this relationship. Our proposed method is available with the R package HIMA at https://cran.r-project.org/web/packages/HIMA/.
AB - The field of mediation analysis commonly explores the pathways that connect environmental exposures with health outcomes. With the development of data collection techniques, greater efforts have been dedicated to addressing high-dimensional mediators. In this paper, we present an efficient approach to identify significant mediators while controlling the false discovery rate (FDR). We propose a three-step procedure that incorporates independent screening, variable selection together with refitted partial regression, and divide-aggregate composite-null test (DACT). The simulation includes a comparative analysis of our proposed method in comparison to eight competing approaches, demonstrating that our procedure has significant advantages over other methods. The proposed procedure is applied to investigate the mediation mechanisms of DNA methylation in the relationship between smoking and lung function. Three specific methylation sites (cg26331243, cg19862839, and cg12616487) are identified as potential epigenetic markers involved in mediating this relationship. Our proposed method is available with the R package HIMA at https://cran.r-project.org/web/packages/HIMA/.
KW - FDR control
KW - High-dimentional mediation analysis
KW - Multiple testing
KW - Variable selection
UR - http://www.scopus.com/inward/record.url?scp=85197799586&partnerID=8YFLogxK
U2 - 10.1007/s12561-024-09447-4
DO - 10.1007/s12561-024-09447-4
M3 - Article
AN - SCOPUS:85197799586
SN - 1867-1764
JO - Statistics in Biosciences
JF - Statistics in Biosciences
ER -