Site-Level Bioactivity of Small-Molecules from Deep-Learned Representations of Quantum Chemistry

Kathryn Sarullo, Matthew K. Matlock, S. Joshua Swamidass

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Atom- or bond-level chemical properties of interest in medicinal chemistry, such as drug metabolism and electrophilic reactivity, are important to understand and predict across arbitrary new molecules. Deep learning can be used to map molecular structures to their chemical properties, but the data sets for these tasks are relatively small, which can limit accuracy and generalizability. To overcome this limitation, it would be preferable to model these properties on the basis of the underlying quantum chemical characteristics of small molecules. However, it is difficult to learn higher level chemical properties from lower level quantum calculations. To overcome this challenge, we pretrained deep learning models to compute quantum chemical properties and then reused the intermediate representations constructed by the pretrained network. Transfer learning, in this way, substantially outperformed models based on chemical graphs alone or quantum chemical properties alone. This result was robust, observable in five prediction tasks: identifying sites of epoxidation by metabolic enzymes and identifying sites of covalent reactivity with cyanide, glutathione, DNA and protein. We see that this approach may substantially improve the accuracy of deep learning models for specific chemical structures, such as aromatic systems.

Original languageEnglish
Pages (from-to)9194-9202
Number of pages9
JournalJournal of Physical Chemistry A
Volume124
Issue number44
DOIs
StatePublished - Nov 5 2020

Fingerprint

Dive into the research topics of 'Site-Level Bioactivity of Small-Molecules from Deep-Learned Representations of Quantum Chemistry'. Together they form a unique fingerprint.

Cite this