Back to Search Start Over

Nonspecific deidentification of date-like text in deidentified clinical notes enables reidentification of dates

Authors :
Jes, Alexander
Alexis, Beatty
Source :
Journal of the American Medical Informatics Association. 29:1967-1971
Publication Year :
2022
Publisher :
Oxford University Press (OUP), 2022.

Abstract

To facilitate the secondary usage of electronic health record data for research, the University of California, San Francisco (UCSF) recently implemented a clinical data warehouse including, among other data, deidentified clinical notes and reports, which are available to UCSF researchers without Institutional Review Board approval. For deidentification of these notes, most of the Health Insurance Portability and Accountability Act identifiers are redacted, but dates are transformed by shifting all dates for a patient back by the same random number of days. We describe an issue in which nonspecific (ie, excess) transformation of nondate, date-like text by this deidentification process enables reidentification of all dates, including birthdates, for certain patients. This issue undercuts the common assumption that excess deidentification is a safe tradeoff to protect patient privacy. We present this issue as a caution to other institutions that may also be considering releasing deidentified notes for research.

Details

ISSN :
1527974X and 10675027
Volume :
29
Database :
OpenAIRE
Journal :
Journal of the American Medical Informatics Association
Accession number :
edsair.doi.dedup.....2e5a1c57774ae7576acb265ac5042d65