Back to Search Start Over

SIENA: Semi-automatic semantic enhancement of datasets using concept recognition

Authors :
Michel Dumontier
Amrapali Zaveri
Andreea Grigoriu
Gerhard Weiss
RS: FdR Institute M-EPLI
RS: FdR Research Group Law and Tech Lab
Private Law
Institute of Data Science
Dept. of Advanced Computing Sciences
RS: FSE DACS
RS: FSE Studio Europa Maastricht
RS: FSE DACS IDS
Dep. of Advanced Computing Sciences
Source :
Journal of Biomedical Semantics, Journal of Biomedical Semantics, Vol 12, Iss 1, Pp 1-12 (2021), Journal of biomedical semantics, 12(1):5. BioMed Central Ltd, 27th Conference on Intelligent Systems for Molecular Biology and the 18th European Conference on Computational Biology and the 18th European Conference on Computational Biology (ISMB and ECCB)
Publication Year :
2019

Abstract

Background The amount of available data, which can facilitate answering scientific research questions, is growing. However, the different formats of published data are expanding as well, creating a serious challenge when multiple datasets need to be integrated for answering a question. Results This paper presents a semi-automated framework that provides semantic enhancement of biomedical data, specifically gene datasets. The framework involved a concept recognition task using machine learning, in combination with the BioPortal annotator. Compared to using methods which require only the BioPortal annotator for semantic enhancement, the proposed framework achieves the highest results. Conclusions Using concept recognition combined with machine learning techniques and annotation with a biomedical ontology, the proposed framework can provide datasets to reach their full potential of providing meaningful information, which can answer scientific research questions.

Details

ISSN :
20411480
Volume :
12
Issue :
1
Database :
OpenAIRE
Journal :
Journal of biomedical semantics
Accession number :
edsair.doi.dedup.....8fc28dcb2944a5bee7341a535b9ad9d4