TY - JOUR
T1 - Quantitative analysis of EGR proteins binding to DNA
T2 - Assessing additivity in both the binding site and the protein
AU - Liu, Jiajian
AU - Stormo, Gary D.
PY - 2005/7/13
Y1 - 2005/7/13
N2 - Background: Recognition codes for protein-DNA interactions typically assume that the interacting positions contribute additively to the binding energy. While this is known to not be precisely true, an additive model over the DNA positions can be a good approximation, at least for some proteins. Much less information is available about whether the protein positions contribute additively to the interaction. Results: Using EGR zinc finger proteins, we measure the binding affinity of six different variants of the protein to each of six different variants of the consensus binding site. Both the protein and binding site variants include single and double mutations that allow us to assess how well additive models can account for the data. For each protein and DNA alone we find that additive models are good approximations, but over the combined set of data there are context effects that limit their accuracy. However, a small modification to the purely additive model, with only three additional parameters, improves the fit significantly. Conclusion: The additive model holds very well for every DNA site and every protein included in this study, but clear context dependence in the interactions was detected. A simple modification to the independent model provides a better fit to the complete data.
AB - Background: Recognition codes for protein-DNA interactions typically assume that the interacting positions contribute additively to the binding energy. While this is known to not be precisely true, an additive model over the DNA positions can be a good approximation, at least for some proteins. Much less information is available about whether the protein positions contribute additively to the interaction. Results: Using EGR zinc finger proteins, we measure the binding affinity of six different variants of the protein to each of six different variants of the consensus binding site. Both the protein and binding site variants include single and double mutations that allow us to assess how well additive models can account for the data. For each protein and DNA alone we find that additive models are good approximations, but over the combined set of data there are context effects that limit their accuracy. However, a small modification to the purely additive model, with only three additional parameters, improves the fit significantly. Conclusion: The additive model holds very well for every DNA site and every protein included in this study, but clear context dependence in the interactions was detected. A simple modification to the independent model provides a better fit to the complete data.
UR - http://www.scopus.com/inward/record.url?scp=25444480735&partnerID=8YFLogxK
U2 - 10.1186/1471-2105-6-176
DO - 10.1186/1471-2105-6-176
M3 - Article
C2 - 16014175
AN - SCOPUS:25444480735
SN - 1471-2105
VL - 6
JO - BMC bioinformatics
JF - BMC bioinformatics
M1 - 176
ER -