De novo identification and visualization of important cell populations for classic hodgkin lymphoma using flow cytometry and machine learning

  • Paul D. Simonson
  • , Yue Wu
  • , David Wu
  • , Jonathan R. Fromm
  • , Aaron Y. Lee

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

Objectives: Automated classification of flow cytometry data has the potential to reduce errors and accelerate flow cytometry interpretation. We desired a machine learning approach that is accurate, is intuitively easy to understand, and highlights the cells that are most important in the algorithm's prediction for a given case. Methods: We developed an ensemble of convolutional neural networks for classification and visualization of impactful cell populations in detecting classic Hodgkin lymphoma using two-dimensional (2D) histograms. Data from 977 and 245 clinical flow cytometry cases were used for training and testing, respectively. Seventy-eight nongated 2D histograms were created per flow cytometry file. Shapley additive explanation (SHAP) values were calculated to determine the most impactful 2D histograms and regions within histograms. SHAP values from all 78 histograms were then projected back to the original cell data for gating and visualization using standard flow cytometry software. Results: The algorithm achieved 67.7% recall (sensitivity), 82.4% precision, and 0.92 area under the receiver operating characteristic. Visualization of the important cell populations for individual predictions demonstrated correlations with known biology. Conclusions: The method presented enables model explainability while highlighting important cell populations in individual flow cytometry specimens, with potential applications in both diagnosis and discovery of previously overlooked key cell populations.

Original languageEnglish
Pages (from-to)1092-1102
Number of pages11
JournalAmerican journal of clinical pathology
Volume156
Issue number6
DOIs
StatePublished - Dec 1 2021

Keywords

  • CNN
  • Convolutional neural network
  • Ensemble classifier
  • Explainability
  • Explainable artificial intelligence
  • Flow cytometry
  • Hodgkin lymphoma
  • Machine learning
  • Random forest
  • SHAP

Fingerprint

Dive into the research topics of 'De novo identification and visualization of important cell populations for classic hodgkin lymphoma using flow cytometry and machine learning'. Together they form a unique fingerprint.

Cite this