Cargando…

No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects

Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery‐driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs autom...

Descripción completa

Detalles Bibliográficos
Autores principales: Danchin, Antoine, Ouzounis, Christos, Tokuyasu, Taku, Zucker, Jean‐Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6011933/
https://www.ncbi.nlm.nih.gov/pubmed/29806194
http://dx.doi.org/10.1111/1751-7915.13284
_version_ 1783333880263081984
author Danchin, Antoine
Ouzounis, Christos
Tokuyasu, Taku
Zucker, Jean‐Daniel
author_facet Danchin, Antoine
Ouzounis, Christos
Tokuyasu, Taku
Zucker, Jean‐Daniel
author_sort Danchin, Antoine
collection PubMed
description Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery‐driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from ‘the sequence tells the structure tells the function’ fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader.
format Online
Article
Text
id pubmed-6011933
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-60119332018-07-05 No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects Danchin, Antoine Ouzounis, Christos Tokuyasu, Taku Zucker, Jean‐Daniel Microb Biotechnol Genomics Update Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery‐driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from ‘the sequence tells the structure tells the function’ fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader. John Wiley and Sons Inc. 2018-05-28 /pmc/articles/PMC6011933/ /pubmed/29806194 http://dx.doi.org/10.1111/1751-7915.13284 Text en © 2018 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomics Update
Danchin, Antoine
Ouzounis, Christos
Tokuyasu, Taku
Zucker, Jean‐Daniel
No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
title No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
title_full No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
title_fullStr No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
title_full_unstemmed No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
title_short No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
title_sort no wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
topic Genomics Update
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6011933/
https://www.ncbi.nlm.nih.gov/pubmed/29806194
http://dx.doi.org/10.1111/1751-7915.13284
work_keys_str_mv AT danchinantoine nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects
AT ouzounischristos nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects
AT tokuyasutaku nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects
AT zuckerjeandaniel nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects