Cargando…

Metadata integrity in bioinformatics: Bridging the gap between data and knowledge

In the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive info...

Descripción completa

Detalles Bibliográficos
Autores principales: Caliskan, Aylin, Dangwal, Seema, Dandekar, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10582761/
https://www.ncbi.nlm.nih.gov/pubmed/37860229
http://dx.doi.org/10.1016/j.csbj.2023.10.006
_version_ 1785122404582817792
author Caliskan, Aylin
Dangwal, Seema
Dandekar, Thomas
author_facet Caliskan, Aylin
Dangwal, Seema
Dandekar, Thomas
author_sort Caliskan, Aylin
collection PubMed
description In the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive information but also due to genomics data being shared and re-analysed among the scientific community. This saves valuable resources and can even help to find new insights in silico. To fully use these opportunities, detailed and correct metadata are imperative. This includes not only the availability of metadata but also their correctness. Metadata integrity serves as a fundamental determinant of research credibility, supporting the reliability and reproducibility of data-driven findings. Ensuring metadata availability, curation, and accuracy are therefore essential for bioinformatic research. Not only must metadata be readily available, but they must also be meticulously curated and ideally error-free. Motivated by an accidental discovery of a critical metadata error in patient data published in two high-impact journals, we aim to raise awareness for the need of correct, complete, and curated metadata. We describe how the metadata error was found, addressed, and present examples for metadata-related challenges in omics research, along with supporting measures, including tools for checking metadata and software to facilitate various steps from data analysis to published research.
format Online
Article
Text
id pubmed-10582761
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-105827612023-10-19 Metadata integrity in bioinformatics: Bridging the gap between data and knowledge Caliskan, Aylin Dangwal, Seema Dandekar, Thomas Comput Struct Biotechnol J Review Article In the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive information but also due to genomics data being shared and re-analysed among the scientific community. This saves valuable resources and can even help to find new insights in silico. To fully use these opportunities, detailed and correct metadata are imperative. This includes not only the availability of metadata but also their correctness. Metadata integrity serves as a fundamental determinant of research credibility, supporting the reliability and reproducibility of data-driven findings. Ensuring metadata availability, curation, and accuracy are therefore essential for bioinformatic research. Not only must metadata be readily available, but they must also be meticulously curated and ideally error-free. Motivated by an accidental discovery of a critical metadata error in patient data published in two high-impact journals, we aim to raise awareness for the need of correct, complete, and curated metadata. We describe how the metadata error was found, addressed, and present examples for metadata-related challenges in omics research, along with supporting measures, including tools for checking metadata and software to facilitate various steps from data analysis to published research. Research Network of Computational and Structural Biotechnology 2023-10-05 /pmc/articles/PMC10582761/ /pubmed/37860229 http://dx.doi.org/10.1016/j.csbj.2023.10.006 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Review Article
Caliskan, Aylin
Dangwal, Seema
Dandekar, Thomas
Metadata integrity in bioinformatics: Bridging the gap between data and knowledge
title Metadata integrity in bioinformatics: Bridging the gap between data and knowledge
title_full Metadata integrity in bioinformatics: Bridging the gap between data and knowledge
title_fullStr Metadata integrity in bioinformatics: Bridging the gap between data and knowledge
title_full_unstemmed Metadata integrity in bioinformatics: Bridging the gap between data and knowledge
title_short Metadata integrity in bioinformatics: Bridging the gap between data and knowledge
title_sort metadata integrity in bioinformatics: bridging the gap between data and knowledge
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10582761/
https://www.ncbi.nlm.nih.gov/pubmed/37860229
http://dx.doi.org/10.1016/j.csbj.2023.10.006
work_keys_str_mv AT caliskanaylin metadataintegrityinbioinformaticsbridgingthegapbetweendataandknowledge
AT dangwalseema metadataintegrityinbioinformaticsbridgingthegapbetweendataandknowledge
AT dandekarthomas metadataintegrityinbioinformaticsbridgingthegapbetweendataandknowledge