Cargando…

PAV ontology: provenance, authoring and versioning

BACKGROUND: Provenance is a critical ingredient for establishing trust of published scientific content. This is true whether we are considering a data set, a computational workflow, a peer-reviewed publication or a simple scientific claim with supportive evidence. Existing vocabularies such as Dubli...

Descripción completa

Detalles Bibliográficos
Autores principales: Ciccarese, Paolo, Soiland-Reyes, Stian, Belhajjame, Khalid, Gray, Alasdair JG, Goble, Carole, Clark, Tim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4177195/
https://www.ncbi.nlm.nih.gov/pubmed/24267948
http://dx.doi.org/10.1186/2041-1480-4-37
_version_ 1782336740410785792
author Ciccarese, Paolo
Soiland-Reyes, Stian
Belhajjame, Khalid
Gray, Alasdair JG
Goble, Carole
Clark, Tim
author_facet Ciccarese, Paolo
Soiland-Reyes, Stian
Belhajjame, Khalid
Gray, Alasdair JG
Goble, Carole
Clark, Tim
author_sort Ciccarese, Paolo
collection PubMed
description BACKGROUND: Provenance is a critical ingredient for establishing trust of published scientific content. This is true whether we are considering a data set, a computational workflow, a peer-reviewed publication or a simple scientific claim with supportive evidence. Existing vocabularies such as Dublin Core Terms (DC Terms) and the W3C Provenance Ontology (PROV-O) are domain-independent and general-purpose and they allow and encourage for extensions to cover more specific needs. In particular, to track authoring and versioning information of web resources, PROV-O provides a basic methodology but not any specific classes and properties for identifying or distinguishing between the various roles assumed by agents manipulating digital artifacts, such as author, contributor and curator. RESULTS: We present the Provenance, Authoring and Versioning ontology (PAV, namespace http://purl.org/pav/): a lightweight ontology for capturing “just enough” descriptions essential for tracking the provenance, authoring and versioning of web resources. We argue that such descriptions are essential for digital scientific content. PAV distinguishes between contributors, authors and curators of content and creators of representations in addition to the provenance of originating resources that have been accessed, transformed and consumed. We explore five projects (and communities) that have adopted PAV illustrating their usage through concrete examples. Moreover, we present mappings that show how PAV extends the W3C PROV-O ontology to support broader interoperability. METHOD: The initial design of the PAV ontology was driven by requirements from the AlzSWAN project with further requirements incorporated later from other projects detailed in this paper. The authors strived to keep PAV lightweight and compact by including only those terms that have demonstrated to be pragmatically useful in existing applications, and by recommending terms from existing ontologies when plausible. DISCUSSION: We analyze and compare PAV with related approaches, namely Provenance Vocabulary (PRV), DC Terms and BIBFRAME. We identify similarities and analyze differences between those vocabularies and PAV, outlining strengths and weaknesses of our proposed model. We specify SKOS mappings that align PAV with DC Terms. We conclude the paper with general remarks on the applicability of PAV.
format Online
Article
Text
id pubmed-4177195
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41771952014-09-28 PAV ontology: provenance, authoring and versioning Ciccarese, Paolo Soiland-Reyes, Stian Belhajjame, Khalid Gray, Alasdair JG Goble, Carole Clark, Tim J Biomed Semantics Research BACKGROUND: Provenance is a critical ingredient for establishing trust of published scientific content. This is true whether we are considering a data set, a computational workflow, a peer-reviewed publication or a simple scientific claim with supportive evidence. Existing vocabularies such as Dublin Core Terms (DC Terms) and the W3C Provenance Ontology (PROV-O) are domain-independent and general-purpose and they allow and encourage for extensions to cover more specific needs. In particular, to track authoring and versioning information of web resources, PROV-O provides a basic methodology but not any specific classes and properties for identifying or distinguishing between the various roles assumed by agents manipulating digital artifacts, such as author, contributor and curator. RESULTS: We present the Provenance, Authoring and Versioning ontology (PAV, namespace http://purl.org/pav/): a lightweight ontology for capturing “just enough” descriptions essential for tracking the provenance, authoring and versioning of web resources. We argue that such descriptions are essential for digital scientific content. PAV distinguishes between contributors, authors and curators of content and creators of representations in addition to the provenance of originating resources that have been accessed, transformed and consumed. We explore five projects (and communities) that have adopted PAV illustrating their usage through concrete examples. Moreover, we present mappings that show how PAV extends the W3C PROV-O ontology to support broader interoperability. METHOD: The initial design of the PAV ontology was driven by requirements from the AlzSWAN project with further requirements incorporated later from other projects detailed in this paper. The authors strived to keep PAV lightweight and compact by including only those terms that have demonstrated to be pragmatically useful in existing applications, and by recommending terms from existing ontologies when plausible. DISCUSSION: We analyze and compare PAV with related approaches, namely Provenance Vocabulary (PRV), DC Terms and BIBFRAME. We identify similarities and analyze differences between those vocabularies and PAV, outlining strengths and weaknesses of our proposed model. We specify SKOS mappings that align PAV with DC Terms. We conclude the paper with general remarks on the applicability of PAV. BioMed Central 2013-11-22 /pmc/articles/PMC4177195/ /pubmed/24267948 http://dx.doi.org/10.1186/2041-1480-4-37 Text en Copyright © 2013 Ciccarese et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Ciccarese, Paolo
Soiland-Reyes, Stian
Belhajjame, Khalid
Gray, Alasdair JG
Goble, Carole
Clark, Tim
PAV ontology: provenance, authoring and versioning
title PAV ontology: provenance, authoring and versioning
title_full PAV ontology: provenance, authoring and versioning
title_fullStr PAV ontology: provenance, authoring and versioning
title_full_unstemmed PAV ontology: provenance, authoring and versioning
title_short PAV ontology: provenance, authoring and versioning
title_sort pav ontology: provenance, authoring and versioning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4177195/
https://www.ncbi.nlm.nih.gov/pubmed/24267948
http://dx.doi.org/10.1186/2041-1480-4-37
work_keys_str_mv AT ciccaresepaolo pavontologyprovenanceauthoringandversioning
AT soilandreyesstian pavontologyprovenanceauthoringandversioning
AT belhajjamekhalid pavontologyprovenanceauthoringandversioning
AT grayalasdairjg pavontologyprovenanceauthoringandversioning
AT goblecarole pavontologyprovenanceauthoringandversioning
AT clarktim pavontologyprovenanceauthoringandversioning