Cargando…

Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base

Pathogen-Host interaction data is core to our understanding of disease processes and their molecular/genetic bases. Facile access to such core data is particularly important for the plant sciences, where individual genetic and phenotypic observations have the added complexity of being dispersed over...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodríguez-Iglesias, Alejandro, Rodríguez-González, Alejandro, Irvine, Alistair G., Sesma, Ane, Urban, Martin, Hammond-Kosack, Kim E., Wilkinson, Mark D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4922217/
https://www.ncbi.nlm.nih.gov/pubmed/27433158
http://dx.doi.org/10.3389/fpls.2016.00641
_version_ 1782439583426805760
author Rodríguez-Iglesias, Alejandro
Rodríguez-González, Alejandro
Irvine, Alistair G.
Sesma, Ane
Urban, Martin
Hammond-Kosack, Kim E.
Wilkinson, Mark D.
author_facet Rodríguez-Iglesias, Alejandro
Rodríguez-González, Alejandro
Irvine, Alistair G.
Sesma, Ane
Urban, Martin
Hammond-Kosack, Kim E.
Wilkinson, Mark D.
author_sort Rodríguez-Iglesias, Alejandro
collection PubMed
description Pathogen-Host interaction data is core to our understanding of disease processes and their molecular/genetic bases. Facile access to such core data is particularly important for the plant sciences, where individual genetic and phenotypic observations have the added complexity of being dispersed over a wide diversity of plant species vs. the relatively fewer host species of interest to biomedical researchers. Recently, an international initiative interested in scholarly data publishing proposed that all scientific data should be “FAIR”—Findable, Accessible, Interoperable, and Reusable. In this work, we describe the process of migrating a database of notable relevance to the plant sciences—the Pathogen-Host Interaction Database (PHI-base)—to a form that conforms to each of the FAIR Principles. We discuss the technical and architectural decisions, and the migration pathway, including observations of the difficulty and/or fidelity of each step. We examine how multiple FAIR principles can be addressed simultaneously through careful design decisions, including making data FAIR for both humans and machines with minimal duplication of effort. We note how FAIR data publishing involves more than data reformatting, requiring features beyond those exhibited by most life science Semantic Web or Linked Data resources. We explore the value-added by completing this FAIR data transformation, and then test the result through integrative questions that could not easily be asked over traditional Web-based data resources. Finally, we demonstrate the utility of providing explicit and reliable access to provenance information, which we argue enhances citation rates by encouraging and facilitating transparent scholarly reuse of these valuable data holdings.
format Online
Article
Text
id pubmed-4922217
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-49222172016-07-18 Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base Rodríguez-Iglesias, Alejandro Rodríguez-González, Alejandro Irvine, Alistair G. Sesma, Ane Urban, Martin Hammond-Kosack, Kim E. Wilkinson, Mark D. Front Plant Sci Plant Science Pathogen-Host interaction data is core to our understanding of disease processes and their molecular/genetic bases. Facile access to such core data is particularly important for the plant sciences, where individual genetic and phenotypic observations have the added complexity of being dispersed over a wide diversity of plant species vs. the relatively fewer host species of interest to biomedical researchers. Recently, an international initiative interested in scholarly data publishing proposed that all scientific data should be “FAIR”—Findable, Accessible, Interoperable, and Reusable. In this work, we describe the process of migrating a database of notable relevance to the plant sciences—the Pathogen-Host Interaction Database (PHI-base)—to a form that conforms to each of the FAIR Principles. We discuss the technical and architectural decisions, and the migration pathway, including observations of the difficulty and/or fidelity of each step. We examine how multiple FAIR principles can be addressed simultaneously through careful design decisions, including making data FAIR for both humans and machines with minimal duplication of effort. We note how FAIR data publishing involves more than data reformatting, requiring features beyond those exhibited by most life science Semantic Web or Linked Data resources. We explore the value-added by completing this FAIR data transformation, and then test the result through integrative questions that could not easily be asked over traditional Web-based data resources. Finally, we demonstrate the utility of providing explicit and reliable access to provenance information, which we argue enhances citation rates by encouraging and facilitating transparent scholarly reuse of these valuable data holdings. Frontiers Media S.A. 2016-05-12 /pmc/articles/PMC4922217/ /pubmed/27433158 http://dx.doi.org/10.3389/fpls.2016.00641 Text en Copyright © 2016 Rodríguez-Iglesias, Rodríguez-González, Irvine, Sesma, Urban, Hammond-Kosack and Wilkinson. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Rodríguez-Iglesias, Alejandro
Rodríguez-González, Alejandro
Irvine, Alistair G.
Sesma, Ane
Urban, Martin
Hammond-Kosack, Kim E.
Wilkinson, Mark D.
Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base
title Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base
title_full Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base
title_fullStr Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base
title_full_unstemmed Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base
title_short Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base
title_sort publishing fair data: an exemplar methodology utilizing phi-base
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4922217/
https://www.ncbi.nlm.nih.gov/pubmed/27433158
http://dx.doi.org/10.3389/fpls.2016.00641
work_keys_str_mv AT rodrigueziglesiasalejandro publishingfairdataanexemplarmethodologyutilizingphibase
AT rodriguezgonzalezalejandro publishingfairdataanexemplarmethodologyutilizingphibase
AT irvinealistairg publishingfairdataanexemplarmethodologyutilizingphibase
AT sesmaane publishingfairdataanexemplarmethodologyutilizingphibase
AT urbanmartin publishingfairdataanexemplarmethodologyutilizingphibase
AT hammondkosackkime publishingfairdataanexemplarmethodologyutilizingphibase
AT wilkinsonmarkd publishingfairdataanexemplarmethodologyutilizingphibase