TY - GEN
T1 - Near-infrared Raman spectroscopy for in-vivo diagnosis of cervical dysplasia - A probability-based multi-class diagnostic algorithm
AU - Majumder, Shovan K.
AU - Kanter, Elizabeth
AU - Viehoever, Amy Robichaux
AU - Jones, Howard
AU - Mahadevan-Jansen, Anita
PY - 2007
Y1 - 2007
N2 - We report the development of a probability-based multi-class diagnostic algorithm to simultaneously distinguish high-grade dysplasia from low-grade dysplasia, squamous metaplasia as well as normal human cervical tissues using near-infrared Raman spectra acquired in-vivo from the cervix of patients at the Vanderbilt University Medical Center. Extraction of diagnostic features from the Raman spectra uses the recently formulated theory of nonlinear Maximum Representation and Discrimination Feature (MRDF), and classification into respective tissue categories is based on the theory of Sparse Multinomial Logistic Regression (SMLR), a recent Bayesian machine-learning framework of statistical pattern recognition. The algorithm based on MRDF and SMLR was found to provide very good diagnostic performance with a predictive accuracy of ∼90% based on leave-one-out cross validation in classifying the tissue Raman spectra into the four different classes, using histology as the "gold standard". The inherently multi-class nature of the algorithm facilitates a rapid and simultaneous classification of tissue spectra into various tissue categories without the need to train and heuristically combine multiple binary classifiers. Further, the probabilistic framework of the algorithm makes it possible to predict the posterior probability of class membership in discriminating the different tissue types.
AB - We report the development of a probability-based multi-class diagnostic algorithm to simultaneously distinguish high-grade dysplasia from low-grade dysplasia, squamous metaplasia as well as normal human cervical tissues using near-infrared Raman spectra acquired in-vivo from the cervix of patients at the Vanderbilt University Medical Center. Extraction of diagnostic features from the Raman spectra uses the recently formulated theory of nonlinear Maximum Representation and Discrimination Feature (MRDF), and classification into respective tissue categories is based on the theory of Sparse Multinomial Logistic Regression (SMLR), a recent Bayesian machine-learning framework of statistical pattern recognition. The algorithm based on MRDF and SMLR was found to provide very good diagnostic performance with a predictive accuracy of ∼90% based on leave-one-out cross validation in classifying the tissue Raman spectra into the four different classes, using histology as the "gold standard". The inherently multi-class nature of the algorithm facilitates a rapid and simultaneous classification of tissue spectra into various tissue categories without the need to train and heuristically combine multiple binary classifiers. Further, the probabilistic framework of the algorithm makes it possible to predict the posterior probability of class membership in discriminating the different tissue types.
KW - Cervical dysplasia
KW - Maximum Representation and Discrimination Feature (MRDF)
KW - Multi-class diagnostic algorithm
KW - Posterior probability
KW - Raman spectroscopy
KW - Sparse Multinomial Logistic Regression (SMLR)
UR - http://www.scopus.com/inward/record.url?scp=34247354981&partnerID=8YFLogxK
U2 - 10.1117/12.724873
DO - 10.1117/12.724873
M3 - Conference contribution
AN - SCOPUS:34247354981
SN - 0819465437
SN - 9780819465436
T3 - Progress in Biomedical Optics and Imaging - Proceedings of SPIE
BT - Advanced Biomedical and Clinical Diagnostic Systems V
T2 - Advanced Biomedical and Clinical Diagnostic Systems V
Y2 - 21 January 2007 through 27 January 2007
ER -