Cargando…

We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data

How do we scale biological science to the demand of next generation biology and medicine to keep track of the facts, predictions, and hypotheses? These days, enormous amounts of DNA sequence and other omics data are generated. Since these data contain the blueprint for life, it is imperative that we...

Descripción completa

Detalles Bibliográficos
Autores principales: Kasif, Simon, Roberts, Richard J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728211/
https://www.ncbi.nlm.nih.gov/pubmed/33253151
http://dx.doi.org/10.1371/journal.pbio.3000999
_version_ 1783621225402073088
author Kasif, Simon
Roberts, Richard J.
author_facet Kasif, Simon
Roberts, Richard J.
author_sort Kasif, Simon
collection PubMed
description How do we scale biological science to the demand of next generation biology and medicine to keep track of the facts, predictions, and hypotheses? These days, enormous amounts of DNA sequence and other omics data are generated. Since these data contain the blueprint for life, it is imperative that we interpret it accurately. The abundance of DNA is only one part of the challenge. Artificial Intelligence (AI) and network methods routinely build on large screens, single cell technologies, proteomics, and other modalities to infer or predict biological functions and phenotypes associated with proteins, pathways, and organisms. As a first step, how do we systematically trace the provenance of knowledge from experimental ground truth to gene function predictions and annotations? Here, we review the main challenges in tracking the evolution of biological knowledge and propose several specific solutions to provenance and computational tracing of evidence in functional linkage networks.
format Online
Article
Text
id pubmed-7728211
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77282112020-12-16 We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data Kasif, Simon Roberts, Richard J. PLoS Biol Essay How do we scale biological science to the demand of next generation biology and medicine to keep track of the facts, predictions, and hypotheses? These days, enormous amounts of DNA sequence and other omics data are generated. Since these data contain the blueprint for life, it is imperative that we interpret it accurately. The abundance of DNA is only one part of the challenge. Artificial Intelligence (AI) and network methods routinely build on large screens, single cell technologies, proteomics, and other modalities to infer or predict biological functions and phenotypes associated with proteins, pathways, and organisms. As a first step, how do we systematically trace the provenance of knowledge from experimental ground truth to gene function predictions and annotations? Here, we review the main challenges in tracking the evolution of biological knowledge and propose several specific solutions to provenance and computational tracing of evidence in functional linkage networks. Public Library of Science 2020-11-30 /pmc/articles/PMC7728211/ /pubmed/33253151 http://dx.doi.org/10.1371/journal.pbio.3000999 Text en © 2020 Kasif, Roberts http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Essay
Kasif, Simon
Roberts, Richard J.
We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
title We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
title_full We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
title_fullStr We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
title_full_unstemmed We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
title_short We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
title_sort we need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
topic Essay
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728211/
https://www.ncbi.nlm.nih.gov/pubmed/33253151
http://dx.doi.org/10.1371/journal.pbio.3000999
work_keys_str_mv AT kasifsimon weneedtokeepareproducibletraceoffactspredictionsandhypothesesfromgenetofunctionintheeraofbigdata
AT robertsrichardj weneedtokeepareproducibletraceoffactspredictionsandhypothesesfromgenetofunctionintheeraofbigdata