1. Comparing methods for identifying pancreatic cancer patients using electronic data sources.
- Author
-
Friedlin J, Overhage M, Al-Haddad MA, Waters JA, Aguilar-Saavedra JJ, Kesterson J, and Schmidt M
- Subjects
- Algorithms, Clinical Coding, Humans, Pancreatic Neoplasms, Sensitivity and Specificity, International Classification of Diseases, Natural Language Processing
- Abstract
We sought to determine the accuracy of two electronic methods of identifying pancreatic cancer in a cohort of pancreatic cyst patients, and to examine the reasons for identification failure. We used the International Classification of Diseases, 9(th) Edition (ICD-9) codes and natural language processing (NLP) technology to identify pancreatic cancer in these patients. We compared both methods to a human-validated gold-standard surgical database. Both ICD-9 codes and NLP technology achieved high sensitivity for identifying pancreatic cancer, but the ICD-9 code method achieved markedly lower specificity and PPV compared to the NLP method. The NLP method required only slightly greater expenditures of time and effort compared to the ICD-9 code method. We identified several variables influencing the accuracy of ICD-9 codes to identify cancer patients including: the identification algorithm, kind of cancer to be identified, presence of other conditions similar to cancer, and presence of conditions that are precancerous.
- Published
- 2010