Abstract
Expectation maximization and Gibbs’ sampling are two statistical approaches used to identify transcription factor binding sites and the motif that represents them. Both take as input unaligned sequences and search for a statistically significant alignment of putative binding sites. Expectation maximization is deterministic so that starting with the same initial parameters will always converge to the same solution, making it wise to start it multiple times from different initial parameters. Gibbs’ sampling is stochastic so that it may arrive at different solutions from the same initial parameters. In both cases multiple runs are advised because comparisons of the solutions after each run can indicate whether a global, optimum solution is likely to have been achieved.
Original language | English |
---|---|
Pages (from-to) | 85-95 |
Number of pages | 11 |
Journal | Methods in Molecular Biology |
Volume | 674 |
DOIs | |
State | Published - 2010 |
Keywords
- Expectation maximization
- Gibbs’ sampling
- Motif discovery
- Motif modeling
- Position frequency matrices
- Position weight matrices
- Regulatory sites
- Transcription factor binding sites