Back to Search
Start Over
Extracting Medical Information from Paper COVID-19 Assessment Forms
- Source :
- Appl Clin Inform
- Publication Year :
- 2021
- Publisher :
- Georg Thieme Verlag KG, 2021.
-
Abstract
- Objective This study examines the validity of optical mark recognition, a novel user interface, and crowdsourced data validation to rapidly digitize and extract data from paper COVID-19 assessment forms at a large medical center. Methods An optical mark recognition/optical character recognition (OMR/OCR) system was developed to identify fields that were selected on 2,814 paper assessment forms, each with 141 fields which were used to assess potential COVID-19 infections. A novel user interface (UI) displayed mirrored forms showing the scanned assessment forms with OMR results superimposed on the left and an editable web form on the right to improve ease of data validation. Crowdsourced participants validated the results of the OMR system. Overall error rate and time taken to validate were calculated. A subset of forms was validated by multiple participants to calculate agreement between participants. Results The OMR/OCR tools correctly extracted data from scanned forms fields with an average accuracy of 70% and median accuracy of 78% when the OMR/OCR results were compared with the results from crowd validation. Scanned forms were crowd-validated at a mean rate of 157 seconds per document and a volume of approximately 108 documents per day. A randomly selected subset of documents was reviewed by multiple participants, producing an interobserver agreement of 97% for documents when narrative-text fields were included and 98% when only Boolean and multiple-choice fields were considered. Conclusion Due to the COVID-19 pandemic, it may be challenging for health care workers wearing personal protective equipment to interact with electronic health records. The combination of OMR/OCR technology, a novel UI, and crowdsourcing data-validation processes allowed for the efficient extraction of a large volume of paper medical documents produced during the COVID-19 pandemic.
- Subjects :
- Optical mark recognition
Health Information Exchange
Coronavirus disease 2019 (COVID-19)
Computer science
Information Storage and Retrieval
Data validation
Word error rate
Health Informatics
Crowdsourcing
computer.software_genre
01 natural sciences
User-Computer Interface
03 medical and health sciences
0302 clinical medicine
Health Information Management
Physicians
Humans
030212 general & internal medicine
0101 mathematics
Information retrieval
business.industry
010102 general mathematics
Volume (computing)
COVID-19
Optical character recognition
Computer Science Applications
User interface
business
computer
Subjects
Details
- ISSN :
- 18690327
- Volume :
- 12
- Database :
- OpenAIRE
- Journal :
- Applied Clinical Informatics
- Accession number :
- edsair.doi.dedup.....d11c6fedc628d783f55034630e99262b
- Full Text :
- https://doi.org/10.1055/s-0041-1723024