Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes

Gurmukh Sahota, Gary D. Stormo

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Motivation: Computational techniques for microbial genomic sequence analysis are becoming increasingly important. With next-generation sequencing technology and the human microbiome project underway, current sequencing capacity is significantly greater than the speed at which organisms of interest can be studied experimentally. Most related computational work has been focused on sequence assembly, gene annotation and metabolic network reconstruction. We have developed a method that will primarily use available sequence data in order to determine prokaryotic transcription factor (TF) binding specificities. Results: Specificity determining residues (critical residues) were identified from crystal structures of DNA-protein complexes and TFs with the same critical residues were grouped into specificity classes. The putative binding regions for each class were defined as the set of promoters for each TF itself (autoregulatory) and the immediately upstream and downstream operons. MEME was used to find putative motifs within each separate class. Tests on the LacI and TetR TF families, using RegulonDB annotated sites, showed the sensitivity of prediction 86% and 80%, respectively.

Original languageEnglish
Article numberbtq501
Pages (from-to)2672-2677
Number of pages6
JournalBioinformatics
Volume26
Issue number21
DOIs
StatePublished - Nov 2010

Fingerprint

Dive into the research topics of 'Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes'. Together they form a unique fingerprint.

Cite this