TY - JOUR
T1 - A hybrid promoter analysis methodology for prokaryotic genomes
AU - Cotik, V.
AU - Romero Zaliz, R.
AU - Zwir, I.
N1 - Funding Information:
This work was partially supported by the Spanish Ministerio de Ciencia y Tecnologia under project TIC2003-00877 (including FEDER fundings).
PY - 2005/5/16
Y1 - 2005/5/16
N2 - One of the big challenges of the post-genomic era is identifying regulatory systems and integrating them into genetic networks. Gene expression is determined by protein-protein interactions among regulatory proteins and with RNA polymerase(s), and protein-DNA interactions of these trans-acting factors with cis-acting DNA sequences in the promoter regions of those regulated genes. Therefore, identifying these protein-DNA interactions, by means of the DNA motifs that characterize the regulatory factors operating in the transcription of a gene, becomes crucial for determining which genes participate in a regulation process, how they behave and how they are connected to build genetic networks. In this paper, we propose a hybrid promoter analysis methodology (HPAM) to discover complex promoter motifs that combines: the neural network efficiency and ability of representing imprecise and incomplete patterns; the flexibility and interpretability of fuzzy models; and the multi-objective evolutionary algorithms capability to identify optimal instances of a model by searching according to multiple criteria. We test our methodology by learning and predicting the RNA polymerase motif in prokaryotic genomes. This constitutes a special challenge due to the multiplicity of the RNA polymerase targets and its connectivity with other transcription factors, which sometimes require multiple functional binding sites even in close located regulatory regions; and the uncertainty of its motif, which allows sites with low specificity (i.e., differing from the best alignment or consensus) to still be functional. HPAM is available for public use in http://soar-tools.wustl.edu.
AB - One of the big challenges of the post-genomic era is identifying regulatory systems and integrating them into genetic networks. Gene expression is determined by protein-protein interactions among regulatory proteins and with RNA polymerase(s), and protein-DNA interactions of these trans-acting factors with cis-acting DNA sequences in the promoter regions of those regulated genes. Therefore, identifying these protein-DNA interactions, by means of the DNA motifs that characterize the regulatory factors operating in the transcription of a gene, becomes crucial for determining which genes participate in a regulation process, how they behave and how they are connected to build genetic networks. In this paper, we propose a hybrid promoter analysis methodology (HPAM) to discover complex promoter motifs that combines: the neural network efficiency and ability of representing imprecise and incomplete patterns; the flexibility and interpretability of fuzzy models; and the multi-objective evolutionary algorithms capability to identify optimal instances of a model by searching according to multiple criteria. We test our methodology by learning and predicting the RNA polymerase motif in prokaryotic genomes. This constitutes a special challenge due to the multiplicity of the RNA polymerase targets and its connectivity with other transcription factors, which sometimes require multiple functional binding sites even in close located regulatory regions; and the uncertainty of its motif, which allows sites with low specificity (i.e., differing from the best alignment or consensus) to still be functional. HPAM is available for public use in http://soar-tools.wustl.edu.
KW - Fuzzy sets
KW - Gene regulation
KW - Multi-objective evolutionary algorithms
KW - Pattern recognition
KW - Prokaryotic promoters
KW - RNA polymerase
KW - Time delay neural networks
UR - http://www.scopus.com/inward/record.url?scp=15344349017&partnerID=8YFLogxK
U2 - 10.1016/j.fss.2004.10.016
DO - 10.1016/j.fss.2004.10.016
M3 - Article
AN - SCOPUS:15344349017
SN - 0165-0114
VL - 152
SP - 83
EP - 102
JO - Fuzzy Sets and Systems
JF - Fuzzy Sets and Systems
IS - 1
ER -