TY - GEN
T1 - Identifying interacting SNPs with parallel fish-agent based logic regression
AU - Wang, Jiayin
AU - Zhang, Jin
AU - Wu, Yufeng
PY - 2011/4/14
Y1 - 2011/4/14
N2 - Understanding the genotype-phenotype association is a fundamental problem in genetics. A major open problem in mapping complex traits is identifying a set of interacting genetic variants (such as single nucleotide polymorphisms or SNPs) that influence disease susceptibility. Logic regression (LR) is a statistical approach that has been proposed to model interactions of SNPs. Several LR-based association detection approaches have been developed in the past. However, existing LR-based approaches are insufficient in handling noisy and increasingly larger data. In this paper, we first develop a relational clustering approach for handling noisy data, where we reduce noise by filtering out unrelated SNPs. We then propose a parallel fish-agent LR approach to speed up the computation. The basic idea of our approach is using multiple fish-agents that explore the model space independently. At each iteration, agents in the same or different clusters communicate with others to achieve faster convergence to the global optimal solutions. Simulation results show that our approach significantly speeds up the LR computation over existing approaches. Also, our results show that our approach achieves good performance in dealing with noise.
AB - Understanding the genotype-phenotype association is a fundamental problem in genetics. A major open problem in mapping complex traits is identifying a set of interacting genetic variants (such as single nucleotide polymorphisms or SNPs) that influence disease susceptibility. Logic regression (LR) is a statistical approach that has been proposed to model interactions of SNPs. Several LR-based association detection approaches have been developed in the past. However, existing LR-based approaches are insufficient in handling noisy and increasingly larger data. In this paper, we first develop a relational clustering approach for handling noisy data, where we reduce noise by filtering out unrelated SNPs. We then propose a parallel fish-agent LR approach to speed up the computation. The basic idea of our approach is using multiple fish-agents that explore the model space independently. At each iteration, agents in the same or different clusters communicate with others to achieve faster convergence to the global optimal solutions. Simulation results show that our approach significantly speeds up the LR computation over existing approaches. Also, our results show that our approach achieves good performance in dealing with noise.
KW - Clustering
KW - Genotype-phenotype association
KW - Logic regression
KW - Parallel computing
KW - Swarm intelligence method
UR - http://www.scopus.com/inward/record.url?scp=79953820490&partnerID=8YFLogxK
U2 - 10.1109/ICCABS.2011.5729874
DO - 10.1109/ICCABS.2011.5729874
M3 - Conference contribution
AN - SCOPUS:79953820490
SN - 9781612848525
T3 - 2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2011
SP - 171
EP - 177
BT - 2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2011
T2 - 1st IEEE International Conference on Computational Advances in Bio and Medical Sciences, ICCABS 2011
Y2 - 3 February 2011 through 5 February 2011
ER -