TY - JOUR
T1 - Prospective and External Validation of an Ensemble Learning Approach to Sensitively Detect Intravenous Fluid Contamination in Basic Metabolic Panels
AU - Spies, Nicholas C.
AU - Militello, Leah
AU - Farnsworth, Christopher W.
AU - El-Khoury, Joe M.
AU - Durant, Thomas J.S.
AU - Zaydman, Mark A.
N1 - Publisher Copyright:
© Association for Diagnostics & Laboratory Medicine 2024. All rights reserved.
PY - 2025/2/1
Y1 - 2025/2/1
N2 - BACKGROUND: Intravenous (IV) fluid contamination within clinical specimens causes an operational burden on the laboratory when detected, and potential patient harm when undetected. Even mild contamination is often sufficient to meaningfully alter results across multiple analytes. A recently reported unsupervised learning approach was more sensitive than routine workflows, but still lacked sensitivity to mild but significant contamination. Here, we leverage ensemble learning to more sensitively detect contaminated results using an approach which is explainable and generalizable across institutions. METHODS: An ensemble-based machine learning pipeline of general and fluid-specific models was trained on real-world and simulated contamination and internally and externally validated. Benchmarks for performance assessment were derived from in silico simulations, in vitro experiments, and expert review. Fluid-specific regression models estimated contamination severity. SHapley Additive exPlanation (SHAP) values were calculated to explain specimen-level predictions, and algorithmic fairness was evaluated by comparing flag rates across demographic and clinical subgroups. RESULTS: The sensitivities, specificities, and Matthews correlation coefficients were 0.858, 0.993, and 0.747 for the internal validation set, and 1.00, 0.980, and 0.387 for the external set. SHAP values provided plausible explanations for dextrose- and ketoacidosis-related hyperglycemia. Flag rates from the pipeline were higher than the current workflow, with improved detection of contamination events expected to exceed allowable limits for measurement error and reference change values. CONCLUSIONS: An accurate, generalizable, and explainable ensemble-based machine learning pipeline was developed and validated for sensitively detecting IV fluid contamination. Implementing this pipeline would help identify errors that are poorly detected by current clinical workflows and a previously described unsupervised machine learning-based method.
AB - BACKGROUND: Intravenous (IV) fluid contamination within clinical specimens causes an operational burden on the laboratory when detected, and potential patient harm when undetected. Even mild contamination is often sufficient to meaningfully alter results across multiple analytes. A recently reported unsupervised learning approach was more sensitive than routine workflows, but still lacked sensitivity to mild but significant contamination. Here, we leverage ensemble learning to more sensitively detect contaminated results using an approach which is explainable and generalizable across institutions. METHODS: An ensemble-based machine learning pipeline of general and fluid-specific models was trained on real-world and simulated contamination and internally and externally validated. Benchmarks for performance assessment were derived from in silico simulations, in vitro experiments, and expert review. Fluid-specific regression models estimated contamination severity. SHapley Additive exPlanation (SHAP) values were calculated to explain specimen-level predictions, and algorithmic fairness was evaluated by comparing flag rates across demographic and clinical subgroups. RESULTS: The sensitivities, specificities, and Matthews correlation coefficients were 0.858, 0.993, and 0.747 for the internal validation set, and 1.00, 0.980, and 0.387 for the external set. SHAP values provided plausible explanations for dextrose- and ketoacidosis-related hyperglycemia. Flag rates from the pipeline were higher than the current workflow, with improved detection of contamination events expected to exceed allowable limits for measurement error and reference change values. CONCLUSIONS: An accurate, generalizable, and explainable ensemble-based machine learning pipeline was developed and validated for sensitively detecting IV fluid contamination. Implementing this pipeline would help identify errors that are poorly detected by current clinical workflows and a previously described unsupervised machine learning-based method.
UR - http://www.scopus.com/inward/record.url?scp=85217152128&partnerID=8YFLogxK
U2 - 10.1093/clinchem/hvae168
DO - 10.1093/clinchem/hvae168
M3 - Article
C2 - 39545815
AN - SCOPUS:85217152128
SN - 0009-9147
VL - 71
SP - 296
EP - 306
JO - Clinical chemistry
JF - Clinical chemistry
IS - 2
ER -