TY - JOUR
T1 - Cis-regulatory element based targeted gene finding
T2 - Genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana
AU - Zhang, Weixiong
AU - Ruan, Jianhua
AU - Ho, Tuan Hua David
AU - You, Youngsook
AU - Yu, Taotao
AU - Quatrano, Ralph S.
N1 - Funding Information:
This research was supported in part by NSF grant EIA-0113618 and a grant from Monsanto Corporation to W.Z. and in part by a grant from Monsanto Corporation to R.S.Q. We thank the other members of W.Z. and R.S.Q.’s groups for the helpful discussions.
PY - 2005/7/15
Y1 - 2005/7/15
N2 - Motivation: A fundamental problem of computational genomics is identifying the genes that respond to certain endogenous cues and environmental stimuli. This problem can be referred to as targeted gene finding. Since gene regulation is mainly determined by the binding of transcription factors and cis-regulatory DNA sequences, most existing gene annotation methods, which exploit the conservation of open reading frames, are not effective in finding target genes. Results: A viable approach to targeted gene finding is to exploit the cis-regulatory elements that are known to be responsible for the transcription of target genes. Given such cis-elements, putative target genes whose promoters contain the elements can be identified. As a case study, we apply the above approach to predict the genes in model plant Arabidopsis thaliana which are inducible by a phytohormone, abscisic acid (ABA), and abiotic stress, such as drought, cold and salinity. We first construct and analyze two ABA specific cis-elements, ABA-responsive element (ABRE) and its coupling element (CE), in A.thaliana, based on their conservation in rice and other cereal plants. We then use the ABRE-CE module to identify putative ABA-responsive genes in A.thaliana. Based on RT-PCR verification and the results from literature, this method has an accuracy rate of 67.5% for the top 40 predictions. The cis-element based targeted gene finding approach is expected to be widely applicable since a large number of cis-elements in many species are available.
AB - Motivation: A fundamental problem of computational genomics is identifying the genes that respond to certain endogenous cues and environmental stimuli. This problem can be referred to as targeted gene finding. Since gene regulation is mainly determined by the binding of transcription factors and cis-regulatory DNA sequences, most existing gene annotation methods, which exploit the conservation of open reading frames, are not effective in finding target genes. Results: A viable approach to targeted gene finding is to exploit the cis-regulatory elements that are known to be responsible for the transcription of target genes. Given such cis-elements, putative target genes whose promoters contain the elements can be identified. As a case study, we apply the above approach to predict the genes in model plant Arabidopsis thaliana which are inducible by a phytohormone, abscisic acid (ABA), and abiotic stress, such as drought, cold and salinity. We first construct and analyze two ABA specific cis-elements, ABA-responsive element (ABRE) and its coupling element (CE), in A.thaliana, based on their conservation in rice and other cereal plants. We then use the ABRE-CE module to identify putative ABA-responsive genes in A.thaliana. Based on RT-PCR verification and the results from literature, this method has an accuracy rate of 67.5% for the top 40 predictions. The cis-element based targeted gene finding approach is expected to be widely applicable since a large number of cis-elements in many species are available.
UR - http://www.scopus.com/inward/record.url?scp=25144457599&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bti490
DO - 10.1093/bioinformatics/bti490
M3 - Article
C2 - 15890746
AN - SCOPUS:25144457599
SN - 1367-4803
VL - 21
SP - 3074
EP - 3081
JO - Bioinformatics
JF - Bioinformatics
IS - 14
ER -