Back to Search
Start Over
The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
- Source :
- Scientific Data, Scientific Data 6 (2019) 1, Scientific Data, 6(1), Scientific Data, Vol 6, Iss 1, Pp 1-9 (2019)
- Publication Year :
- 2019
-
Abstract
- The RDF data model facilitates integration of diverse data available in structured and semi-structured formats. To obtain a coherent RDF graph the chosen ontology must be consistently applied. However, addition of new diverse data causes the ontology to evolve, which could lead to accumulation of unintended erroneous composites. Thus, there is a need for a gate keeping system that compares the intended content described in the ontology with the actual content of the resource. The Empusa code generator facilitates creation of composite RDF resources from disparate sources. Empusa can convert a schema into an associated application programming interface (API), that can be used to perform data consistency checks and generates Markdown documentation to make persistent URLs resolvable. Using Empusa consistency is ensured within and between the ontology and the content of the resource. As an illustration of the potential of Empusa, we present the Genome Biology Ontology Language (GBOL). GBOL uses and extends current ontologies to provide a formal representation of genomic entities, along with their properties, relations and provenance. The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
- Subjects :
- Statistics and Probability
Data consistency
Computer science
Library and Information Sciences
Ontology (information science)
computer.software_genre
Article
Education
03 medical and health sciences
0302 clinical medicine
Documentation
Animals
Humans
Life Science
Systems and Synthetic Biology
Code generation
RDF
lcsh:Science
VLAG
030304 developmental biology
Systeem en Synthetische Biologie
0303 health sciences
Genome
Information retrieval
Application programming interface
Ontology
Systembiologi
Molecular Sequence Annotation
computer.file_format
Ontology language
Computer Science Applications
Gene Ontology
Matematikk og naturvitenskap: 400 [VDP]
Mathematics and natural scienses: 400 [VDP]
Ontologi
lcsh:Q
Statistics, Probability and Uncertainty
Systems biology
computer
Software
030217 neurology & neurosurgery
Markdown
Information Systems
Subjects
Details
- Language :
- English
- ISSN :
- 20524463
- Database :
- OpenAIRE
- Journal :
- Scientific Data, Scientific Data 6 (2019) 1, Scientific Data, 6(1), Scientific Data, Vol 6, Iss 1, Pp 1-9 (2019)
- Accession number :
- edsair.doi.dedup.....0cfd8b1a761225038d20a8de9e28aa8a