Cargando…

BioSamples database: FAIRer samples metadata to accelerate research data management

The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number...

Descripción completa

Detalles Bibliográficos
Autores principales: Courtot, Mélanie, Gupta, Dipayan, Liyanage, Isuru, Xu, Fuqi, Burdett, Tony
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728232/
https://www.ncbi.nlm.nih.gov/pubmed/34747489
http://dx.doi.org/10.1093/nar/gkab1046
_version_ 1784626692088659968
author Courtot, Mélanie
Gupta, Dipayan
Liyanage, Isuru
Xu, Fuqi
Burdett, Tony
author_facet Courtot, Mélanie
Gupta, Dipayan
Liyanage, Isuru
Xu, Fuqi
Burdett, Tony
author_sort Courtot, Mélanie
collection PubMed
description The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles. In this manuscript, we describe how BioSamples in 2021 handles requirements from our community of users through exemplar use cases: increased findability of samples and improved data management practices support the goals of the ReSOLUTE project, how the plant community benefits from being able to link genotypic to phenotypic information, and we highlight how cumulatively those improvements contribute to more complex multi-omics data integration supporting COVID-19 research. Finally, we present underlying technical features used as pillars throughout those use cases and how they are reused for expanded engagement with communities such as FAIRplus and the Global Alliance for Genomics and Health. Availability: The BioSamples database is freely available at http://www.ebi.ac.uk/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https://www.ebi.ac.uk/about/terms-of-use. The BioSamples code is available at https://github.com/EBIBioSamples/biosamples-v4 and distributed under the Apache 2.0 license.
format Online
Article
Text
id pubmed-8728232
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87282322022-01-05 BioSamples database: FAIRer samples metadata to accelerate research data management Courtot, Mélanie Gupta, Dipayan Liyanage, Isuru Xu, Fuqi Burdett, Tony Nucleic Acids Res Database Issue The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles. In this manuscript, we describe how BioSamples in 2021 handles requirements from our community of users through exemplar use cases: increased findability of samples and improved data management practices support the goals of the ReSOLUTE project, how the plant community benefits from being able to link genotypic to phenotypic information, and we highlight how cumulatively those improvements contribute to more complex multi-omics data integration supporting COVID-19 research. Finally, we present underlying technical features used as pillars throughout those use cases and how they are reused for expanded engagement with communities such as FAIRplus and the Global Alliance for Genomics and Health. Availability: The BioSamples database is freely available at http://www.ebi.ac.uk/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https://www.ebi.ac.uk/about/terms-of-use. The BioSamples code is available at https://github.com/EBIBioSamples/biosamples-v4 and distributed under the Apache 2.0 license. Oxford University Press 2021-11-08 /pmc/articles/PMC8728232/ /pubmed/34747489 http://dx.doi.org/10.1093/nar/gkab1046 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Courtot, Mélanie
Gupta, Dipayan
Liyanage, Isuru
Xu, Fuqi
Burdett, Tony
BioSamples database: FAIRer samples metadata to accelerate research data management
title BioSamples database: FAIRer samples metadata to accelerate research data management
title_full BioSamples database: FAIRer samples metadata to accelerate research data management
title_fullStr BioSamples database: FAIRer samples metadata to accelerate research data management
title_full_unstemmed BioSamples database: FAIRer samples metadata to accelerate research data management
title_short BioSamples database: FAIRer samples metadata to accelerate research data management
title_sort biosamples database: fairer samples metadata to accelerate research data management
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728232/
https://www.ncbi.nlm.nih.gov/pubmed/34747489
http://dx.doi.org/10.1093/nar/gkab1046
work_keys_str_mv AT courtotmelanie biosamplesdatabasefairersamplesmetadatatoaccelerateresearchdatamanagement
AT guptadipayan biosamplesdatabasefairersamplesmetadatatoaccelerateresearchdatamanagement
AT liyanageisuru biosamplesdatabasefairersamplesmetadatatoaccelerateresearchdatamanagement
AT xufuqi biosamplesdatabasefairersamplesmetadatatoaccelerateresearchdatamanagement
AT burdetttony biosamplesdatabasefairersamplesmetadatatoaccelerateresearchdatamanagement