Probabilistic substructure mining from small-molecule screens

Sayan Ranu, Bradley T. Calhoun, Ambuj K. Singh, S. Joshua Swamidass

Research output: Contribution to journalArticle

13 Scopus citations


Identifying the overrepresented substructures from a set of molecules with similar activity is a common task in chemical informatics. Existing substructure miners are deterministic, requiring the activity of all mined molecules to be known with high confidence. In contrast, we introduce pGraphSig, a probabilistic structure miner, which effectively mines structures from noisy data, where many molecules are labeled with their probability of being active. We benchmark pGraphSig on data from several small-molecule high throughput screens, finding that it can more effectively identify overrepresented structures than a deterministic structure miner.

Original languageEnglish
Pages (from-to)809-815
Number of pages7
JournalMolecular Informatics
Issue number9
StatePublished - Sep 1 2011

Fingerprint Dive into the research topics of 'Probabilistic substructure mining from small-molecule screens'. Together they form a unique fingerprint.

  • Cite this