TY - GEN
T1 - Identification and evaluation of functional modules in gene co-expression networks
AU - Ruan, Jianhua
AU - Zhang, Weixiong
PY - 2007
Y1 - 2007
N2 - Identifying gene functional modules is an important step towards elucidating gene functions at a global scale. In this paper, we introduce a simple method to construct gene co-expression networks from microarray data, and then propose an efficient spectral clustering algorithm to identify natural communities, which are relatively densely connected sub-graphs, in the network. To assess the effectiveness of our approach and its advantage over existing methods, we develop a novel method to measure the agreement between the gene communities and the modular structures in other reference networks, including protein-protein interaction networks, transcriptional regulatory networks, and gene networks derived from gene annotations. We evaluate the proposed methods on two large-scale gene expression data in budding yeast and Arabidopsis thaliana. The results show that the clusters identified by our method are functionally more coherent than the clusters from several standard clustering algorithms, such as k-means, self-organizing maps, and spectral clustering, and have high agreement to the modular structures in the reference networks.
AB - Identifying gene functional modules is an important step towards elucidating gene functions at a global scale. In this paper, we introduce a simple method to construct gene co-expression networks from microarray data, and then propose an efficient spectral clustering algorithm to identify natural communities, which are relatively densely connected sub-graphs, in the network. To assess the effectiveness of our approach and its advantage over existing methods, we develop a novel method to measure the agreement between the gene communities and the modular structures in other reference networks, including protein-protein interaction networks, transcriptional regulatory networks, and gene networks derived from gene annotations. We evaluate the proposed methods on two large-scale gene expression data in budding yeast and Arabidopsis thaliana. The results show that the clusters identified by our method are functionally more coherent than the clusters from several standard clustering algorithms, such as k-means, self-organizing maps, and spectral clustering, and have high agreement to the modular structures in the reference networks.
KW - Clustering
KW - Co-expression networks
KW - Community identification
KW - Microarray
UR - http://www.scopus.com/inward/record.url?scp=38049110883&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-73060-6_5
DO - 10.1007/978-3-540-73060-6_5
M3 - Conference contribution
AN - SCOPUS:38049110883
SN - 9783540730590
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 57
EP - 76
BT - Systems Biology and Computational Proteomics - Joint RECOMB 2006 Satellite Workshops on Systems Biology and on Computational Proteomics, Revised Selected Papers
PB - Springer Verlag
T2 - Joint RECOMB 2006 Satellite Workshops on Systems Biology and on Computational Proteomics
Y2 - 1 December 2006 through 3 December 2006
ER -