Cargando…

Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies

Crop phenotypic data underpin many pre-breeding efforts to characterize variation within germplasm collections. Although there has been an increase in the global capacity for accumulating and comparing such data, a lack of consistency in the systematic description of metadata often limits integratio...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrés-Hernández, Liliana, Halimi, Razlin Azman, Mauleon, Ramil, Mayes, Sean, Baten, Abdul, King, Graham J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8122365/
https://www.ncbi.nlm.nih.gov/pubmed/33991093
http://dx.doi.org/10.1093/database/baab028
_version_ 1783692597091368960
author Andrés-Hernández, Liliana
Halimi, Razlin Azman
Mauleon, Ramil
Mayes, Sean
Baten, Abdul
King, Graham J
author_facet Andrés-Hernández, Liliana
Halimi, Razlin Azman
Mauleon, Ramil
Mayes, Sean
Baten, Abdul
King, Graham J
author_sort Andrés-Hernández, Liliana
collection PubMed
description Crop phenotypic data underpin many pre-breeding efforts to characterize variation within germplasm collections. Although there has been an increase in the global capacity for accumulating and comparing such data, a lack of consistency in the systematic description of metadata often limits integration and sharing. We therefore aimed to understand some of the challenges facing findable, accesible, interoperable and reusable (FAIR) curation and annotation of phenotypic data from minor and underutilized crops. We used bambara groundnut (Vigna subterranea) as an exemplar underutilized crop to assess the ability of the Crop Ontology system to facilitate curation of trait datasets, so that they are accessible for comparative analysis. This involved generating a controlled vocabulary Trait Dictionary of 134 terms. Systematic quantification of syntactic and semantic cohesiveness of the full set of 28 crop-specific COs identified inconsistencies between trait descriptor names, a relative lack of cross-referencing to other ontologies and a flat ontological structure for classifying traits. We also evaluated the Minimal Information About a Phenotyping Experiment and FAIR compliance of bambara trait datasets curated within the CropStoreDB schema. We discuss specifications for a more systematic and generic approach to trait controlled vocabularies, which would benefit from representation of terms that adhere to Open Biological and Biomedical Ontologies principles. In particular, we focus on the benefits of reuse of existing definitions within pre- and post-composed axioms from other domains in order to facilitate the curation and comparison of datasets from a wider range of crops. Database URL: https://www.cropstoredb.org/cs_bambara.html
format Online
Article
Text
id pubmed-8122365
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81223652021-05-21 Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies Andrés-Hernández, Liliana Halimi, Razlin Azman Mauleon, Ramil Mayes, Sean Baten, Abdul King, Graham J Database (Oxford) Original Article Crop phenotypic data underpin many pre-breeding efforts to characterize variation within germplasm collections. Although there has been an increase in the global capacity for accumulating and comparing such data, a lack of consistency in the systematic description of metadata often limits integration and sharing. We therefore aimed to understand some of the challenges facing findable, accesible, interoperable and reusable (FAIR) curation and annotation of phenotypic data from minor and underutilized crops. We used bambara groundnut (Vigna subterranea) as an exemplar underutilized crop to assess the ability of the Crop Ontology system to facilitate curation of trait datasets, so that they are accessible for comparative analysis. This involved generating a controlled vocabulary Trait Dictionary of 134 terms. Systematic quantification of syntactic and semantic cohesiveness of the full set of 28 crop-specific COs identified inconsistencies between trait descriptor names, a relative lack of cross-referencing to other ontologies and a flat ontological structure for classifying traits. We also evaluated the Minimal Information About a Phenotyping Experiment and FAIR compliance of bambara trait datasets curated within the CropStoreDB schema. We discuss specifications for a more systematic and generic approach to trait controlled vocabularies, which would benefit from representation of terms that adhere to Open Biological and Biomedical Ontologies principles. In particular, we focus on the benefits of reuse of existing definitions within pre- and post-composed axioms from other domains in order to facilitate the curation and comparison of datasets from a wider range of crops. Database URL: https://www.cropstoredb.org/cs_bambara.html Oxford University Press 2021-05-15 /pmc/articles/PMC8122365/ /pubmed/33991093 http://dx.doi.org/10.1093/database/baab028 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Andrés-Hernández, Liliana
Halimi, Razlin Azman
Mauleon, Ramil
Mayes, Sean
Baten, Abdul
King, Graham J
Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies
title Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies
title_full Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies
title_fullStr Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies
title_full_unstemmed Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies
title_short Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies
title_sort challenges for fair-compliant description and comparison of crop phenotype data with standardized controlled vocabularies
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8122365/
https://www.ncbi.nlm.nih.gov/pubmed/33991093
http://dx.doi.org/10.1093/database/baab028
work_keys_str_mv AT andreshernandezliliana challengesforfaircompliantdescriptionandcomparisonofcropphenotypedatawithstandardizedcontrolledvocabularies
AT halimirazlinazman challengesforfaircompliantdescriptionandcomparisonofcropphenotypedatawithstandardizedcontrolledvocabularies
AT mauleonramil challengesforfaircompliantdescriptionandcomparisonofcropphenotypedatawithstandardizedcontrolledvocabularies
AT mayessean challengesforfaircompliantdescriptionandcomparisonofcropphenotypedatawithstandardizedcontrolledvocabularies
AT batenabdul challengesforfaircompliantdescriptionandcomparisonofcropphenotypedatawithstandardizedcontrolledvocabularies
AT kinggrahamj challengesforfaircompliantdescriptionandcomparisonofcropphenotypedatawithstandardizedcontrolledvocabularies