Abstract
Identifying the overrepresented substructures from a set of molecules with similar activity is a common task in chemical informatics. Existing substructure miners are deterministic, requiring the activity of all mined molecules to be known with high confidence. In contrast, we introduce pGraphSig, a probabilistic structure miner, which effectively mines structures from noisy data, where many molecules are labeled with their probability of being active. We benchmark pGraphSig on data from several small-molecule high throughput screens, finding that it can more effectively identify overrepresented structures than a deterministic structure miner.
| Original language | English |
|---|---|
| Pages (from-to) | 809-815 |
| Number of pages | 7 |
| Journal | Molecular Informatics |
| Volume | 30 |
| Issue number | 9 |
| DOIs | |
| State | Published - Sep 2011 |
Fingerprint
Dive into the research topics of 'Probabilistic substructure mining from small-molecule screens'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver