Back to Search
Start Over
An ontology-based approach for developing a harmonised data-validation tool for European cancer registration
- Source :
- Journal of Biomedical Semantics, r-FISABIO. Repositorio Institucional de Producción Científica, instname, Journal of Biomedical Semantics, Vol 12, Iss 1, Pp 1-15 (2021)
- Publication Year :
- 2021
- Publisher :
- BioMed Central, 2021.
-
Abstract
- Background Population-based cancer registries constitute an important information source in cancer epidemiology. Studies collating and comparing data across regional and national boundaries have proved important for deploying and evaluating effective cancer-control strategies. A critical aspect in correctly comparing cancer indicators across regional and national boundaries lies in ensuring a good and harmonised level of data quality, which is a primary motivator for a centralised collection of pseudonymised data. The recent introduction of the European Union’s general data-protection regulation (GDPR) imposes stricter conditions on the collection, processing, and sharing of personal data. It also considers pseudonymised data as personal data. The new regulation motivates the need to find solutions that allow a continuation of the smooth processes leading to harmonised European cancer-registry data. One element in this regard would be the availability of a data-validation software tool based on a formalised depiction of the harmonised data-validation rules, allowing an eventual devolution of the data-validation process to the local level. Results A semantic data model was derived from the data-validation rules for harmonising cancer-data variables at European level. The data model was encapsulated in an ontology developed using the Web-Ontology Language (OWL) with the data-model entities forming the main OWL classes. The data-validation rules were added as axioms in the ontology. The reasoning function of the resulting ontology demonstrated its ability to trap registry-coding errors and in some instances to be able to correct errors. Conclusions Describing the European cancer-registry core data set in terms of an OWL ontology affords a tool based on a formalised set of axioms for validating a cancer-registry’s data set according to harmonised, supra-national rules. The fact that the data checks are inherently linked to the data model would lead to less maintenance overheads and also allow automatic versioning synchronisation, important for distributed data-quality checking processes.
- Subjects :
- Computer Networks and Communications
Computer science
Population
Data validation
Health Informatics
Ontology (information science)
Semantic data model
lcsh:Computer applications to medicine. Medical informatics
03 medical and health sciences
Data harmonisation
Neoplasms
Humans
media_common.cataloged_instance
Data federation
European union
education
Language
030304 developmental biology
0505 law
computer.programming_language
media_common
0303 health sciences
education.field_of_study
Ontology
05 social sciences
Web Ontology Language
Cancer registry
Data science
Computer Science Applications
Data model
Data quality
050501 criminology
lcsh:R858-859.7
computer
Software
Semantic web
Information Systems
Subjects
Details
- ISSN :
- 20411480
- Database :
- OpenAIRE
- Journal :
- Journal of Biomedical Semantics, r-FISABIO. Repositorio Institucional de Producción Científica, instname, Journal of Biomedical Semantics, Vol 12, Iss 1, Pp 1-15 (2021)
- Accession number :
- edsair.doi.dedup.....55987ee09114a6b24e570937f55e32d9