Cargando…

The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation

The RDF data model facilitates integration of diverse data available in structured and semi-structured formats. To obtain a coherent RDF graph the chosen ontology must be consistently applied. However, addition of new diverse data causes the ontology to evolve, which could lead to accumulation of un...

Descripción completa

Detalles Bibliográficos
Autores principales: van Dam, Jesse C. J., Koehorst, Jasper J., Vik, Jon Olav, Martins dos Santos, Vitor A. P., Schaap, Peter J., Suarez-Diez, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6828702/
https://www.ncbi.nlm.nih.gov/pubmed/31685817
http://dx.doi.org/10.1038/s41597-019-0263-7
_version_ 1783465408306610176
author van Dam, Jesse C. J.
Koehorst, Jasper J.
Vik, Jon Olav
Martins dos Santos, Vitor A. P.
Schaap, Peter J.
Suarez-Diez, Maria
author_facet van Dam, Jesse C. J.
Koehorst, Jasper J.
Vik, Jon Olav
Martins dos Santos, Vitor A. P.
Schaap, Peter J.
Suarez-Diez, Maria
author_sort van Dam, Jesse C. J.
collection PubMed
description The RDF data model facilitates integration of diverse data available in structured and semi-structured formats. To obtain a coherent RDF graph the chosen ontology must be consistently applied. However, addition of new diverse data causes the ontology to evolve, which could lead to accumulation of unintended erroneous composites. Thus, there is a need for a gate keeping system that compares the intended content described in the ontology with the actual content of the resource. The Empusa code generator facilitates creation of composite RDF resources from disparate sources. Empusa can convert a schema into an associated application programming interface (API), that can be used to perform data consistency checks and generates Markdown documentation to make persistent URLs resolvable. Using Empusa consistency is ensured within and between the ontology and the content of the resource. As an illustration of the potential of Empusa, we present the Genome Biology Ontology Language (GBOL). GBOL uses and extends current ontologies to provide a formal representation of genomic entities, along with their properties, relations and provenance.
format Online
Article
Text
id pubmed-6828702
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68287022019-11-07 The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation van Dam, Jesse C. J. Koehorst, Jasper J. Vik, Jon Olav Martins dos Santos, Vitor A. P. Schaap, Peter J. Suarez-Diez, Maria Sci Data Article The RDF data model facilitates integration of diverse data available in structured and semi-structured formats. To obtain a coherent RDF graph the chosen ontology must be consistently applied. However, addition of new diverse data causes the ontology to evolve, which could lead to accumulation of unintended erroneous composites. Thus, there is a need for a gate keeping system that compares the intended content described in the ontology with the actual content of the resource. The Empusa code generator facilitates creation of composite RDF resources from disparate sources. Empusa can convert a schema into an associated application programming interface (API), that can be used to perform data consistency checks and generates Markdown documentation to make persistent URLs resolvable. Using Empusa consistency is ensured within and between the ontology and the content of the resource. As an illustration of the potential of Empusa, we present the Genome Biology Ontology Language (GBOL). GBOL uses and extends current ontologies to provide a formal representation of genomic entities, along with their properties, relations and provenance. Nature Publishing Group UK 2019-11-04 /pmc/articles/PMC6828702/ /pubmed/31685817 http://dx.doi.org/10.1038/s41597-019-0263-7 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
van Dam, Jesse C. J.
Koehorst, Jasper J.
Vik, Jon Olav
Martins dos Santos, Vitor A. P.
Schaap, Peter J.
Suarez-Diez, Maria
The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
title The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
title_full The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
title_fullStr The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
title_full_unstemmed The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
title_short The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation
title_sort empusa code generator and its application to gbol, an extendable ontology for genome annotation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6828702/
https://www.ncbi.nlm.nih.gov/pubmed/31685817
http://dx.doi.org/10.1038/s41597-019-0263-7
work_keys_str_mv AT vandamjessecj theempusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT koehorstjasperj theempusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT vikjonolav theempusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT martinsdossantosvitorap theempusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT schaappeterj theempusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT suarezdiezmaria theempusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT vandamjessecj empusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT koehorstjasperj empusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT vikjonolav empusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT martinsdossantosvitorap empusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT schaappeterj empusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation
AT suarezdiezmaria empusacodegeneratoranditsapplicationtogbolanextendableontologyforgenomeannotation