Back to Search
Start Over
Nonspecific deidentification of date-like text in deidentified clinical notes enables reidentification of dates
- Source :
- Journal of the American Medical Informatics Association. 29:1967-1971
- Publication Year :
- 2022
- Publisher :
- Oxford University Press (OUP), 2022.
-
Abstract
- To facilitate the secondary usage of electronic health record data for research, the University of California, San Francisco (UCSF) recently implemented a clinical data warehouse including, among other data, deidentified clinical notes and reports, which are available to UCSF researchers without Institutional Review Board approval. For deidentification of these notes, most of the Health Insurance Portability and Accountability Act identifiers are redacted, but dates are transformed by shifting all dates for a patient back by the same random number of days. We describe an issue in which nonspecific (ie, excess) transformation of nondate, date-like text by this deidentification process enables reidentification of all dates, including birthdates, for certain patients. This issue undercuts the common assumption that excess deidentification is a safe tradeoff to protect patient privacy. We present this issue as a caution to other institutions that may also be considering releasing deidentified notes for research.
Details
- ISSN :
- 1527974X and 10675027
- Volume :
- 29
- Database :
- OpenAIRE
- Journal :
- Journal of the American Medical Informatics Association
- Accession number :
- edsair.doi.dedup.....2e5a1c57774ae7576acb265ac5042d65