TY - JOUR
T1 - Integrative analysis of somatic mutations and transcriptomic data to functionally stratify breast cancer patients
AU - Zhang, Jie
AU - Abrams, Zachary
AU - Parvin, Jeffrey D.
AU - Huang, Kun
N1 - Publisher Copyright:
© 2016 The Author(s).
PY - 2016/8/22
Y1 - 2016/8/22
N2 - Background: Somatic mutations can be used as potential biomarkers for subtyping and predicting outcomes for cancer patients. However, cancer patients often carry many somatic mutations, which do not always concentrate on specific genomic loci, suggesting that the mutations may affect common pathways or gene interaction networks instead of common genes. The challenge is thus to identify the functional relationships among the mutations using multi-modal data. We developed a novel approach for integrating patient somatic mutation, transcriptome and clinical data to mine underlying functional gene groups that can be used to stratify cancer patients into groups with different clinical outcomes. Specifically, we use distance correlation metric to mine the correlations between expression profiles of mutated genes from different patients. Results: With this approach, we were able to cluster patients based on the functional relationships between the affected genes using their expression profiles, and to visualize the results using multi-dimensional scaling. Interestingly, we identified a stable subgroup of breast cancer patients that are highly enriched with ER-negative and triple-negative subtypes, and the somatic mutation genes they harbor were capable of acting as potential biomarkers to predict patient survival in several different breast cancer datasets, especially in ER-negative cohorts which has lacked reliable biomarkers. Conclusions: Our method provides a novel and promising approach for integrating genotyping and gene expression data in patient stratification in complex diseases.
AB - Background: Somatic mutations can be used as potential biomarkers for subtyping and predicting outcomes for cancer patients. However, cancer patients often carry many somatic mutations, which do not always concentrate on specific genomic loci, suggesting that the mutations may affect common pathways or gene interaction networks instead of common genes. The challenge is thus to identify the functional relationships among the mutations using multi-modal data. We developed a novel approach for integrating patient somatic mutation, transcriptome and clinical data to mine underlying functional gene groups that can be used to stratify cancer patients into groups with different clinical outcomes. Specifically, we use distance correlation metric to mine the correlations between expression profiles of mutated genes from different patients. Results: With this approach, we were able to cluster patients based on the functional relationships between the affected genes using their expression profiles, and to visualize the results using multi-dimensional scaling. Interestingly, we identified a stable subgroup of breast cancer patients that are highly enriched with ER-negative and triple-negative subtypes, and the somatic mutation genes they harbor were capable of acting as potential biomarkers to predict patient survival in several different breast cancer datasets, especially in ER-negative cohorts which has lacked reliable biomarkers. Conclusions: Our method provides a novel and promising approach for integrating genotyping and gene expression data in patient stratification in complex diseases.
KW - Breast cancer patient stratification
KW - Distance correlation
KW - Functional analysis of somatic mutation
KW - Integrative analysis
UR - http://www.scopus.com/inward/record.url?scp=84983084203&partnerID=8YFLogxK
U2 - 10.1186/s12864-016-2902-0
DO - 10.1186/s12864-016-2902-0
M3 - Article
C2 - 27556157
AN - SCOPUS:84983084203
SN - 1471-2164
VL - 17
JO - BMC genomics
JF - BMC genomics
M1 - 513
ER -