TY - JOUR
T1 - An Iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots
AU - Ruan, Jianhua
AU - Stormo, Gary D.
AU - Zhang, Weixiong
N1 - Funding Information:
We thank Elena Rivas and Sean Eddy for providing the PKNOTS program and results. We also thank the anonymous reviewers for their very useful comments. This research was supported in part by NSF grants IIS-0196057 and ITR/EIA-0113618. G.D.S. was supported by NIH grant HG00249.
PY - 2004/1/1
Y1 - 2004/1/1
N2 - Motivation: Pseudoknots have generally been excluded from the prediction of RNA secondary structures due to its difficulty in modeling. Although, several dynamic programming algorithms exist for the prediction of pseudoknots using thermodynamic approaches, they are neither reliable nor efficient. On the other hand, comparative methods are more reliable, but are often done in an ad hoc manner and require expert intervention. Maximum weighted matching, an algorithm for pseudoknot prediction with comparative analysis, suffers from low-prediction accuracy in many cases. Results: Here we present an algorithm, iterated loop matching, for reliably and efficiently predicting RNA secondary structures including pseudoknots. The method can utilize either thermodynamic or comparative information or both, thus is able to predict pseudoknots for both aligned and individual sequences. We have tested the algorithm on a number of RNA families. Using 8-12 homologous sequences, the algorithm correctly identifies more than 90% of base-pairs for short sequences and 80% overall. It correctly predicts nearly all pseudoknots and produces very few spurious base-pairs for sequences without pseudoknots. Comparisons show that our algorithm is both more sensitive and more specific than the maximum weighted matching method. In addition, our algorithm has high-prediction accuracy on individual sequences, comparable with the PKNOTS algorithm, while using much less computational resources.
AB - Motivation: Pseudoknots have generally been excluded from the prediction of RNA secondary structures due to its difficulty in modeling. Although, several dynamic programming algorithms exist for the prediction of pseudoknots using thermodynamic approaches, they are neither reliable nor efficient. On the other hand, comparative methods are more reliable, but are often done in an ad hoc manner and require expert intervention. Maximum weighted matching, an algorithm for pseudoknot prediction with comparative analysis, suffers from low-prediction accuracy in many cases. Results: Here we present an algorithm, iterated loop matching, for reliably and efficiently predicting RNA secondary structures including pseudoknots. The method can utilize either thermodynamic or comparative information or both, thus is able to predict pseudoknots for both aligned and individual sequences. We have tested the algorithm on a number of RNA families. Using 8-12 homologous sequences, the algorithm correctly identifies more than 90% of base-pairs for short sequences and 80% overall. It correctly predicts nearly all pseudoknots and produces very few spurious base-pairs for sequences without pseudoknots. Comparisons show that our algorithm is both more sensitive and more specific than the maximum weighted matching method. In addition, our algorithm has high-prediction accuracy on individual sequences, comparable with the PKNOTS algorithm, while using much less computational resources.
UR - http://www.scopus.com/inward/record.url?scp=0347724163&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btg373
DO - 10.1093/bioinformatics/btg373
M3 - Article
C2 - 14693809
AN - SCOPUS:0347724163
SN - 1367-4803
VL - 20
SP - 58
EP - 66
JO - Bioinformatics
JF - Bioinformatics
IS - 1
ER -