TY - JOUR
T1 - Canonical variate regression
AU - Luo, Chongliang
AU - Liu, Jin
AU - Dey, Dipak K.
AU - Chen, Kun
N1 - Funding Information:
U.S. National Institutes of Health grant U01-HL114494 and Simons
Publisher Copyright:
© 2016 The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
PY - 2016/7/1
Y1 - 2016/7/1
N2 - In many fields, multi-view datasets, measuring multiple distinct but interrelated sets of characteristics on the same set of subjects, together with data on certain outcomes or phenotypes, are routinely collected. The objective in such a problem is often two-fold: both to explore the association structures of multiple sets of measurements and to develop a parsimonious model for predicting the future outcomes. We study a unified canonical variate regression framework to tackle the two problems simultaneously. The proposed criterion integrates multiple canonical correlation analysis with predictive modeling, balancing between the association strength of the canonical variates and their joint predictive power on the outcomes. Moreover, the proposed criterion seeks multiple sets of canonical variates simultaneously to enable the examination of their joint effects on the outcomes, and is able to handle multivariate and non-Gaussian outcomes. An efficient algorithm based on variable splitting and Lagrangian multipliers is proposed. Simulation studies show the superior performance of the proposed approach. We demonstrate the effectiveness of the proposed approach in an F-2 intercross mice study and an alcohol dependence study.
AB - In many fields, multi-view datasets, measuring multiple distinct but interrelated sets of characteristics on the same set of subjects, together with data on certain outcomes or phenotypes, are routinely collected. The objective in such a problem is often two-fold: both to explore the association structures of multiple sets of measurements and to develop a parsimonious model for predicting the future outcomes. We study a unified canonical variate regression framework to tackle the two problems simultaneously. The proposed criterion integrates multiple canonical correlation analysis with predictive modeling, balancing between the association strength of the canonical variates and their joint predictive power on the outcomes. Moreover, the proposed criterion seeks multiple sets of canonical variates simultaneously to enable the examination of their joint effects on the outcomes, and is able to handle multivariate and non-Gaussian outcomes. An efficient algorithm based on variable splitting and Lagrangian multipliers is proposed. Simulation studies show the superior performance of the proposed approach. We demonstrate the effectiveness of the proposed approach in an F-2 intercross mice study and an alcohol dependence study.
KW - Canonical correlation analysis
KW - Integrative analysis
KW - Reduced-rank regression
KW - Supervised learning
UR - http://www.scopus.com/inward/record.url?scp=84977097839&partnerID=8YFLogxK
U2 - 10.1093/biostatistics/kxw001
DO - 10.1093/biostatistics/kxw001
M3 - Article
C2 - 26861909
AN - SCOPUS:84977097839
SN - 1465-4644
VL - 17
SP - 468
EP - 483
JO - Biostatistics
JF - Biostatistics
IS - 3
ER -