TY - JOUR
T1 - Extracting seizure frequency from epilepsy clinic notes
T2 - A machine reading approach to natural language processing
AU - Xie, Kevin
AU - Gallagher, Ryan S.
AU - Conrad, Erin C.
AU - Garrick, Chadric O.
AU - Baldassano, Steven N.
AU - Bernabei, John M.
AU - Galer, Peter D.
AU - Ghosn, Nina J.
AU - Greenblatt, Adam S.
AU - Jennings, Tara
AU - Kornspun, Alana
AU - Kulick-Soper, Catherine V.
AU - Panchal, Jal M.
AU - Pattnaik, Akash R.
AU - Scheid, Brittany H.
AU - Wei, Danmeng
AU - Weitzman, Micah
AU - Muthukrishnan, Ramya
AU - Kim, Joongwon
AU - Litt, Brian
AU - Ellis, Colin A.
AU - Roth, Dan
N1 - Publisher Copyright:
© 2022 The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association.
PY - 2022/5/1
Y1 - 2022/5/1
N2 - Objective: Seizure frequency and seizure freedom are among the most important outcome measures for patients with epilepsy. In this study, we aimed to automatically extract this clinical information from unstructured text in clinical notes. If successful, this could improve clinical decision-making in epilepsy patients and allow for rapid, large-scale retrospective research. Materials and Methods: We developed a finetuning pipeline for pretrained neural models to classify patients as being seizure-free and to extract text containing their seizure frequency and date of last seizure from clinical notes. We annotated 1000 notes for use as training and testing data and determined how well 3 pretrained neural models, BERT, RoBERTa, and Bio_ClinicalBERT, could identify and extract the desired information after finetuning. Results: The finetuned models (BERTFT, Bio_ClinicalBERTFT, and RoBERTaFT) achieved near-human performance when classifying patients as seizure free, with BERTFT and Bio_ClinicalBERTFT achieving accuracy scores over 80%. All 3 models also achieved human performance when extracting seizure frequency and date of last seizure, with overall F1 scores over 0.80. The best combination of models was Bio_ClinicalBERTFT for classification, and RoBERTaFT for text extraction. Most of the gains in performance due to finetuning required roughly 70 annotated notes. Discussion and Conclusion: Our novel machine reading approach to extracting important clinical outcomes performed at or near human performance on several tasks. This approach opens new possibilities to support clinical practice and conduct large-scale retrospective clinical research. Future studies can use our finetuning pipeline with minimal training annotations to answer new clinical questions.
AB - Objective: Seizure frequency and seizure freedom are among the most important outcome measures for patients with epilepsy. In this study, we aimed to automatically extract this clinical information from unstructured text in clinical notes. If successful, this could improve clinical decision-making in epilepsy patients and allow for rapid, large-scale retrospective research. Materials and Methods: We developed a finetuning pipeline for pretrained neural models to classify patients as being seizure-free and to extract text containing their seizure frequency and date of last seizure from clinical notes. We annotated 1000 notes for use as training and testing data and determined how well 3 pretrained neural models, BERT, RoBERTa, and Bio_ClinicalBERT, could identify and extract the desired information after finetuning. Results: The finetuned models (BERTFT, Bio_ClinicalBERTFT, and RoBERTaFT) achieved near-human performance when classifying patients as seizure free, with BERTFT and Bio_ClinicalBERTFT achieving accuracy scores over 80%. All 3 models also achieved human performance when extracting seizure frequency and date of last seizure, with overall F1 scores over 0.80. The best combination of models was Bio_ClinicalBERTFT for classification, and RoBERTaFT for text extraction. Most of the gains in performance due to finetuning required roughly 70 annotated notes. Discussion and Conclusion: Our novel machine reading approach to extracting important clinical outcomes performed at or near human performance on several tasks. This approach opens new possibilities to support clinical practice and conduct large-scale retrospective clinical research. Future studies can use our finetuning pipeline with minimal training annotations to answer new clinical questions.
KW - electronic medical record
KW - epilepsy
KW - natural language processing
KW - question-answering
UR - https://www.scopus.com/pages/publications/85128488775
U2 - 10.1093/jamia/ocac018
DO - 10.1093/jamia/ocac018
M3 - Article
C2 - 35190834
AN - SCOPUS:85128488775
SN - 1067-5027
VL - 29
SP - 873
EP - 881
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 5
ER -