TY - JOUR
T1 - Leveraging knowledge engineering and machine learning for microbial bio-manufacturing
AU - Oyetunde, Tolutola
AU - Bao, Forrest Sheng
AU - Chen, Jiung Wen
AU - Martin, Hector Garcia
AU - Tang, Yinjie J.
N1 - Publisher Copyright:
© 2018 Elsevier Inc.
PY - 2018/7/1
Y1 - 2018/7/1
N2 - Genome scale modeling (GSM) predicts the performance of microbial workhorses and helps identify beneficial gene targets. GSM integrated with intracellular flux dynamics, omics, and thermodynamics have shown remarkable progress in both elucidating complex cellular phenomena and computational strain design (CSD). Nonetheless, these models still show high uncertainty due to a poor understanding of innate pathway regulations, metabolic burdens, and other factors (such as stress tolerance and metabolite channeling). Besides, the engineered hosts may have genetic mutations or non-genetic variations in bioreactor conditions and thus CSD rarely foresees fermentation rate and titer. Metabolic models play important role in design-build-test-learn cycles for strain improvement, and machine learning (ML) may provide a viable complementary approach for driving strain design and deciphering cellular processes. In order to develop quality ML models, knowledge engineering leverages and standardizes the wealth of information in literature (e.g., genomic/phenomic data, synthetic biology strategies, and bioprocess variables). Data driven frameworks can offer new constraints for mechanistic models to describe cellular regulations, to design pathways, to search gene targets, and to estimate fermentation titer/rate/yield under specified growth conditions (e.g., mixing, nutrients, and O2). This review highlights the scope of information collections, database constructions, and machine learning techniques (such as deep learning and transfer learning), which may facilitate “Learn and Design” for strain development.
AB - Genome scale modeling (GSM) predicts the performance of microbial workhorses and helps identify beneficial gene targets. GSM integrated with intracellular flux dynamics, omics, and thermodynamics have shown remarkable progress in both elucidating complex cellular phenomena and computational strain design (CSD). Nonetheless, these models still show high uncertainty due to a poor understanding of innate pathway regulations, metabolic burdens, and other factors (such as stress tolerance and metabolite channeling). Besides, the engineered hosts may have genetic mutations or non-genetic variations in bioreactor conditions and thus CSD rarely foresees fermentation rate and titer. Metabolic models play important role in design-build-test-learn cycles for strain improvement, and machine learning (ML) may provide a viable complementary approach for driving strain design and deciphering cellular processes. In order to develop quality ML models, knowledge engineering leverages and standardizes the wealth of information in literature (e.g., genomic/phenomic data, synthetic biology strategies, and bioprocess variables). Data driven frameworks can offer new constraints for mechanistic models to describe cellular regulations, to design pathways, to search gene targets, and to estimate fermentation titer/rate/yield under specified growth conditions (e.g., mixing, nutrients, and O2). This review highlights the scope of information collections, database constructions, and machine learning techniques (such as deep learning and transfer learning), which may facilitate “Learn and Design” for strain development.
KW - Deep learning
KW - Design-build-test-learn
KW - Genome scale modeling
KW - Metabolic burdens
UR - http://www.scopus.com/inward/record.url?scp=85046668709&partnerID=8YFLogxK
U2 - 10.1016/j.biotechadv.2018.04.008
DO - 10.1016/j.biotechadv.2018.04.008
M3 - Review article
C2 - 29729378
AN - SCOPUS:85046668709
SN - 0734-9750
VL - 36
SP - 1308
EP - 1315
JO - Biotechnology Advances
JF - Biotechnology Advances
IS - 4
ER -