1. Establishing a sorting protocol for healthcare databases
- Author
-
Laurie E. AbiHabib, Myriam Mrad Nakhle, Maher Abboud, Isabella Annesi-Maesano, Wehbeh Farah, Nelly Ziade, Elias Chalhoub, Elie Ghabi, Hibade, Monique, University of Balamand [Liban] (UOB), Université Saint-Joseph de Beyrouth (USJ), Institut Pierre Louis d'Epidémiologie et de Santé Publique (iPLESP), Institut National de la Santé et de la Recherche Médicale (INSERM)-Sorbonne Université (SU), Epidemiology of Allergic and Respiratory Diseases Department [iPlesp] (EPAR), Institut National de la Santé et de la Recherche Médicale (INSERM)-Sorbonne Université (SU)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Sorbonne Université (SU), Institut Desbrest de santé publique (IDESP), and Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Montpellier (UM)
- Subjects
Sorting algorithm ,Computer science ,State of health ,Process (engineering) ,[SDV]Life Sciences [q-bio] ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,Article ,03 medical and health sciences ,health record ,0302 clinical medicine ,Health care ,data cleaning ,database ,0105 earth and related environmental sciences ,Protocol (science) ,Database ,business.industry ,lcsh:Public aspects of medicine ,Sorting ,lcsh:RA1-1270 ,[SDV] Life Sciences [q-bio] ,Sorting protocol ,030228 respiratory system ,Categorization ,Data extraction ,business ,computer - Abstract
Background: Health information records in many countries, especially developing countries, are still paper based. Compared to electronic systems, paper-based systems are disadvantageous in terms of data storage and data extraction. Given the importance of health records for epidemiological studies, guidelines for effective data cleaning and sorting are essential. They are, however, largely absent from the literature. The following paper discusses the process by which an algorithm was developed for the cleaning and sorting of a database generated from emergency department records in Lebanon. Design and methods: Demographic and health related information were extracted from the emergency department records of three hospitals in Beirut. Appropriate categories were selected for data categorization. For health information, disease categories and codes were selected according to the International Classification of Disease 10th Edition. Results: A total of 16,537 entries were collected. Demographic information was categorized into groups for future epidemiological studies. Analysis of the health information led to the creation of a sorting algorithm which was then used to categorize and code the health data. Several counts were then performed to represent and visualize the data numerically and graphically. Conclusions: The article describes the current state of health information records in Lebanon and the associated disadvantages of a paper-based system in terms of storage and data extraction. Furthermore, the article describes the algorithm by which health information was sorted and categorized to allow for future data analysis using paper records. Significance for public health In our protocol, we explain the process by which health information collected from paper-based records were analyzed to develop an algorithm to allocate appropriate ICD 10 disease codes and categories to each entry. The algorithm allows for the sequential analysis of the information present and serves to overcome the issue of incomplete records. This is especially relevant in developing countries where paper-based records are highly relied upon, noting that our protocol has already been applied to a study in the Ivory Coast. The protocol allows for a health information database to be cleaned and sorted for research in environmental, social and occupational health. Furthermore, since data cleaning and sorting protocols are sparse, it is crucial to establish a standardized tool to enhance the quality of the conducted research.
- Published
- 2021
- Full Text
- View/download PDF