Identifying target sites for cooperatively binding factors

D. GuhaThakurta, G. D. Stormo

Research output: Contribution to journalArticlepeer-review

146 Scopus citations

Abstract

Motivation: Transcriptional activation in eukaryotic organisms normally requires combinatorial interactions of multiple transcription factors. Though several methods exist for identification of individual protein binding site patterns in DNA sequences, there are few methods for discovery of binding site patterns for cooperatively acting factors. Here we present an algorithm, Co-Bind (for COperative BINDing), for discovering DNA target sites for cooperatively acting transcription factors. The method utilizes a Gibbs sampling strategy to model the cooperativity between two transcription factors and defines position weight matrices for the binding sites. Sequences from both the training set and the entire genome are taken into account, in order to discriminate against commonly occurring patterns in the genome, and produce patterns which are significant only in the training set. Results: We have tested Co-Bind on semi-synthetic and real data sets to show it can efficiently identify DNA target site patterns for cooperatively binding transcription factors. In cases where binding site patterns are weak and cannot be identified by other available methods, Co-Bind, by virtue of modeling the cooperativity between factors, can identify those sites efficiently. Though developed to model protein-DNA interactions, the scope of Co-Bind may be extended to combinatorial, sequence specific, interactions in other macromolecules.

Original languageEnglish
Pages (from-to)608-621
Number of pages14
JournalBioinformatics
Volume17
Issue number7
DOIs
StatePublished - 2001

Fingerprint

Dive into the research topics of 'Identifying target sites for cooperatively binding factors'. Together they form a unique fingerprint.

Cite this