Cargando…
No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects
Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery‐driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs autom...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6011933/ https://www.ncbi.nlm.nih.gov/pubmed/29806194 http://dx.doi.org/10.1111/1751-7915.13284 |
_version_ | 1783333880263081984 |
---|---|
author | Danchin, Antoine Ouzounis, Christos Tokuyasu, Taku Zucker, Jean‐Daniel |
author_facet | Danchin, Antoine Ouzounis, Christos Tokuyasu, Taku Zucker, Jean‐Daniel |
author_sort | Danchin, Antoine |
collection | PubMed |
description | Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery‐driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from ‘the sequence tells the structure tells the function’ fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader. |
format | Online Article Text |
id | pubmed-6011933 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-60119332018-07-05 No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects Danchin, Antoine Ouzounis, Christos Tokuyasu, Taku Zucker, Jean‐Daniel Microb Biotechnol Genomics Update Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery‐driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from ‘the sequence tells the structure tells the function’ fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader. John Wiley and Sons Inc. 2018-05-28 /pmc/articles/PMC6011933/ /pubmed/29806194 http://dx.doi.org/10.1111/1751-7915.13284 Text en © 2018 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genomics Update Danchin, Antoine Ouzounis, Christos Tokuyasu, Taku Zucker, Jean‐Daniel No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects |
title | No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects |
title_full | No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects |
title_fullStr | No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects |
title_full_unstemmed | No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects |
title_short | No wisdom in the crowd: genome annotation in the era of big data – current status and future prospects |
title_sort | no wisdom in the crowd: genome annotation in the era of big data – current status and future prospects |
topic | Genomics Update |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6011933/ https://www.ncbi.nlm.nih.gov/pubmed/29806194 http://dx.doi.org/10.1111/1751-7915.13284 |
work_keys_str_mv | AT danchinantoine nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects AT ouzounischristos nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects AT tokuyasutaku nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects AT zuckerjeandaniel nowisdominthecrowdgenomeannotationintheeraofbigdatacurrentstatusandfutureprospects |