TY - JOUR
T1 - Bias associated with mining electronic health records
AU - Hripcsak, George
AU - Knirsch, Charles
AU - Zhou, Li
AU - Wilcox, Adam
AU - Melton, Genevieve B.
PY - 2011
Y1 - 2011
N2 - Large-scale electronic health record research introduces biases compared to traditional manually curated retrospective research. We used data from a community-acquired pneumonia study for which we had a gold standard to illustrate such biases. The challenges include data inaccuracy, incompleteness, and complexity, and they can produce in distorted results. We found that a nal̈ve approach approximated the gold standard, but errors ona minority of cases shifted mortality substantially. Manual review revealed errors in both selecting and characterizing the cohort, and narrowing the cohort improved the result. Nevertheless, a significantly narrowed cohort might contain its own biases that would be difficult to estimate.
AB - Large-scale electronic health record research introduces biases compared to traditional manually curated retrospective research. We used data from a community-acquired pneumonia study for which we had a gold standard to illustrate such biases. The challenges include data inaccuracy, incompleteness, and complexity, and they can produce in distorted results. We found that a nal̈ve approach approximated the gold standard, but errors ona minority of cases shifted mortality substantially. Manual review revealed errors in both selecting and characterizing the cohort, and narrowing the cohort improved the result. Nevertheless, a significantly narrowed cohort might contain its own biases that would be difficult to estimate.
UR - http://www.scopus.com/inward/record.url?scp=84862855859&partnerID=8YFLogxK
U2 - 10.5210%2Fdisco.v6i0.3634
DO - 10.5210%2Fdisco.v6i0.3634
M3 - Article
AN - SCOPUS:84862855859
SN - 1747-5333
VL - 6
SP - 48
EP - 52
JO - Journal of Biomedical Discovery and Collaboration
JF - Journal of Biomedical Discovery and Collaboration
IS - 1
ER -