TY - JOUR
T1 - Additivity in protein-DNA interactions
T2 - How good an approximation is it?
AU - Benos, Panayiotis V.
AU - Bulyk, Martha L.
AU - Stormo, Gary D.
N1 - Funding Information:
The Optoelectronics Research Centre is an Interdisciplinary Research Centre of the United Kingdom Engineering and Physical Sciences Research Council (EPSRC). D. M. Atkin is the recipient of an EPSRC CASE award with the Defence Research Agency, Malvern.
PY - 2002/10/15
Y1 - 2002/10/15
N2 - Man and Stormo and Bulyk et al. recently presented their results on the study of the DNA binding affinity of proteins. In both of these studies the main conclusion is that the additivity assumption, usually applied in methods to search for binding sites, is not true. In the first study, the analysis of binding affinity data from the Mnt repressor protein bound to all possible DNA (sub)targets at positions 16 and 17 of the binding site, showed that those positions are not independent. In the second study, the authors analysed DNA binding affinity data of the wild-type mouse EGR1 protein and four variants differing on the middle finger. The binding affinity of these proteins was measured to all 64 possible trinucleotide (sub)targets of the middle finger using microarray technology. The analysis of the measurements also showed interdependence among the positions in the DNA target. In the present report, we review the data of both studies and we reanalyse them using various statistical methods, including a comparison with a multiple regression approach. We conclude that despite the fact that the additivity assumption does not fit the data perfectly, in most cases it provides a very good approximation of the true nature of the specific protein-DNA interactions. Therefore, additive models can be very useful for the discovery and prediction of binding sites in genomic DNA.
AB - Man and Stormo and Bulyk et al. recently presented their results on the study of the DNA binding affinity of proteins. In both of these studies the main conclusion is that the additivity assumption, usually applied in methods to search for binding sites, is not true. In the first study, the analysis of binding affinity data from the Mnt repressor protein bound to all possible DNA (sub)targets at positions 16 and 17 of the binding site, showed that those positions are not independent. In the second study, the authors analysed DNA binding affinity data of the wild-type mouse EGR1 protein and four variants differing on the middle finger. The binding affinity of these proteins was measured to all 64 possible trinucleotide (sub)targets of the middle finger using microarray technology. The analysis of the measurements also showed interdependence among the positions in the DNA target. In the present report, we review the data of both studies and we reanalyse them using various statistical methods, including a comparison with a multiple regression approach. We conclude that despite the fact that the additivity assumption does not fit the data perfectly, in most cases it provides a very good approximation of the true nature of the specific protein-DNA interactions. Therefore, additive models can be very useful for the discovery and prediction of binding sites in genomic DNA.
UR - http://www.scopus.com/inward/record.url?scp=0037109192&partnerID=8YFLogxK
U2 - 10.1093/nar/gkf578
DO - 10.1093/nar/gkf578
M3 - Article
C2 - 12384591
AN - SCOPUS:0037109192
SN - 0305-1048
VL - 30
SP - 4442
EP - 4451
JO - Nucleic acids research
JF - Nucleic acids research
IS - 20
ER -