TY - JOUR
T1 - 'Black Box' to 'Conversational' Machine Learning
T2 - Ondansetron Reduces Risk of Hospital-Acquired Venous Thromboembolism
AU - Datta, Arghya
AU - Matlock, Matthew K.
AU - Le Dang, Na
AU - Moulin, Thiago
AU - Woeltje, Keith F.
AU - Yanik, Elizabeth L.
AU - Joshua Swamidass, Sanjay
N1 - Funding Information:
Manuscript received November 21, 2019; revised July 17, 2020 and October 8, 2020; accepted October 16, 2020. Date of publication October 23, 2020; date of current version June 4, 2021. This work was supported by the National Library Of Medicine of the National Institutes of Health under award numbers R01LM012222 and R01LM012482. Computations were performed using the facilities of the Washington University Center for High Performance Computing, which were partially funded by NIH under Grants 1S10RR022984-01A1 and 1S10OD018091-01. (Arghya Datta is first author.) (Corresponding author: Sanjay Joshua Swamidass.) Arghya Datta is with the Department of Computer Science and Engineering, Washington University in Saint Louis, Saint Louis, MO 63130 USA (e-mail: arghya@wustl.edu).
Publisher Copyright:
© 2013 IEEE.
PY - 2021/6
Y1 - 2021/6
N2 - Machine learning, combined with a proliferation of electronic healthcare records (EHR), has the potential to transform medicine by identifying previously unknown interventions that reduce the risk of adverse outcomes. To realize this potential, machine learning must leave the conceptual 'black box' in complex domains to overcome several pitfalls, like the presence of confounding variables. These variables predict outcomes but are not causal, often yielding uninformative models. In this work, we envision a 'conversational' approach to design machine learning models, which couple modeling decisions to domain expertise. We demonstrate this approach via a retrospective cohort study to identify factors which affect the risk of hospital-acquired venous thromboembolism (HA-VTE). Using logistic regression for modeling, we have identified drugs that reduce the risk of HA-VTE. Our analysis reveals that ondansetron, an anti-nausea and anti-emetic medication, commonly used in treating side-effects of chemotherapy and post-general anesthesia period, substantially reduces the risk of HA-VTE when compared to aspirin (11% vs. 15% relative risk reduction or RRR, respectively). The low cost and low morbidity of ondansetron may justify further inquiry into its use as a preventative agent for HA-VTE. This case study highlights the importance of engaging domain expertise while applying machine learning in complex domains.
AB - Machine learning, combined with a proliferation of electronic healthcare records (EHR), has the potential to transform medicine by identifying previously unknown interventions that reduce the risk of adverse outcomes. To realize this potential, machine learning must leave the conceptual 'black box' in complex domains to overcome several pitfalls, like the presence of confounding variables. These variables predict outcomes but are not causal, often yielding uninformative models. In this work, we envision a 'conversational' approach to design machine learning models, which couple modeling decisions to domain expertise. We demonstrate this approach via a retrospective cohort study to identify factors which affect the risk of hospital-acquired venous thromboembolism (HA-VTE). Using logistic regression for modeling, we have identified drugs that reduce the risk of HA-VTE. Our analysis reveals that ondansetron, an anti-nausea and anti-emetic medication, commonly used in treating side-effects of chemotherapy and post-general anesthesia period, substantially reduces the risk of HA-VTE when compared to aspirin (11% vs. 15% relative risk reduction or RRR, respectively). The low cost and low morbidity of ondansetron may justify further inquiry into its use as a preventative agent for HA-VTE. This case study highlights the importance of engaging domain expertise while applying machine learning in complex domains.
KW - Venous thromboembolism
KW - clinical informatics
KW - electronic healthcare records
KW - machine learning
KW - prediction models
UR - http://www.scopus.com/inward/record.url?scp=85105420748&partnerID=8YFLogxK
U2 - 10.1109/JBHI.2020.3033405
DO - 10.1109/JBHI.2020.3033405
M3 - Article
C2 - 33095721
AN - SCOPUS:85105420748
SN - 2168-2194
VL - 25
SP - 2204
EP - 2214
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 6
M1 - 9238396
ER -