TY - GEN
T1 - Is the reduction of dimensionality to a small number of features always necessary in constructing predictive models for analysis of complex diseases or behaviours?
AU - Zollanvari, Amin
AU - Saccone, Nancy L.
AU - Bierut, Laura J.
AU - Ramoni, Marco F.
AU - Alterovitz, Gil
PY - 2011
Y1 - 2011
N2 - Gene expression and genome wide association data have provided researchers the opportunity to study many complex traits and diseases. When designing prognostic and predictive models capable of phenotypic classification in this area, significant reduction of dimensionality through stringent filtering and/or feature selection is often deemed imperative. Here, this work challenges this presumption through both theoretical and empirical analysis. This work demonstrates that by a proper compromise between structure of the selected model and the number of features, one is able to achieve better performance even in large dimensionality. The inclusion of many genes/variants in the classification rules can help shed new light on the analysis of complex traitstraits that are typically determined by many causal variants with small effect size.
AB - Gene expression and genome wide association data have provided researchers the opportunity to study many complex traits and diseases. When designing prognostic and predictive models capable of phenotypic classification in this area, significant reduction of dimensionality through stringent filtering and/or feature selection is often deemed imperative. Here, this work challenges this presumption through both theoretical and empirical analysis. This work demonstrates that by a proper compromise between structure of the selected model and the number of features, one is able to achieve better performance even in large dimensionality. The inclusion of many genes/variants in the classification rules can help shed new light on the analysis of complex traitstraits that are typically determined by many causal variants with small effect size.
UR - http://www.scopus.com/inward/record.url?scp=84055199163&partnerID=8YFLogxK
U2 - 10.1109/IEMBS.2011.6090596
DO - 10.1109/IEMBS.2011.6090596
M3 - Conference contribution
C2 - 22255111
AN - SCOPUS:84055199163
SN - 9781424441211
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 3573
EP - 3576
BT - 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS 2011
T2 - 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS 2011
Y2 - 30 August 2011 through 3 September 2011
ER -