Back to Search Start Over

Protected Health Information Recognition of Unstructured Code-Mixed Electronic Health Records in Taiwan.

Authors :
Lee YQ
Wang BH
Su CH
Chen PT
Lin WQ
Wu CS
Dai HJ
Source :
Studies in health technology and informatics [Stud Health Technol Inform] 2022 Jun 06; Vol. 290, pp. 627-631.
Publication Year :
2022

Abstract

Electronic health records (EHRs) at medical institutions provide valuable sources for research in both clinical and biomedical domains. However, before such records can be used for research purposes, protected health information (PHI) mentioned in the unstructured text must be removed. In Taiwan's EHR systems the unstructured EHR texts are usually represented in the mixing of English and Chinese languages, which brings challenges for de-identification. This paper presented the first study, to the best of our knowledge, of the construction of a code-mixed EHR de-identification corpus and the evaluation of different mature entity recognition methods applied for the code-mixed PHI recognition task.

Details

Language :
English
ISSN :
1879-8365
Volume :
290
Database :
MEDLINE
Journal :
Studies in health technology and informatics
Publication Type :
Academic Journal
Accession number :
35673092
Full Text :
https://doi.org/10.3233/SHTI220153