Abstract
Electronic clinical documentation can be useful for activities such as public health surveillance, quality improvement, and research, but existing methods of de-identification may not provide sufficient protection of patient data. The general-purpose natural language processor MedLEE retains medical concepts while excluding the remaining text so, in addition to processing text into structured data, it may be able provide a secondary benefit of de-identification. Without modifying the system, the authors tested the ability of MedLEE to remove protected health information (PHI) by comparing 100 outpatient clinical notes with the corresponding XML-tagged output. Of 809 instances of PHI, 26 (3.2%) were detected in output as a result of processing and identification errors. However, PHI in the output was highly transformed, much appearing as normalized terms for medical concepts, potentially making re-identification more difficult. The MedLEE processor may be a good enhancement to other de-identification systems, both removing PHI and providing coded data from clinical text.
| Original language | English |
|---|---|
| Pages (from-to) | 37-39 |
| Number of pages | 3 |
| Journal | Journal of the American Medical Informatics Association |
| Volume | 16 |
| Issue number | 1 |
| DOIs | |
| State | Published - Jan 2009 |
Fingerprint
Dive into the research topics of 'Repurposing the Clinical Record: Can an Existing Natural Language Processing System De-identify Clinical Notes?'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver