Cargando…

Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper

Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines are relatively inaccessible (e.g., new assignments only available through database updates), less precise homology-based functional t...

Descripción completa

Detalles Bibliográficos
Autores principales: Huerta-Cepas, Jaime, Forslund, Kristoffer, Coelho, Luis Pedro, Szklarczyk, Damian, Jensen, Lars Juhl, von Mering, Christian, Bork, Peer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850834/
https://www.ncbi.nlm.nih.gov/pubmed/28460117
http://dx.doi.org/10.1093/molbev/msx148
_version_ 1783306292687798272
author Huerta-Cepas, Jaime
Forslund, Kristoffer
Coelho, Luis Pedro
Szklarczyk, Damian
Jensen, Lars Juhl
von Mering, Christian
Bork, Peer
author_facet Huerta-Cepas, Jaime
Forslund, Kristoffer
Coelho, Luis Pedro
Szklarczyk, Damian
Jensen, Lars Juhl
von Mering, Christian
Bork, Peer
author_sort Huerta-Cepas, Jaime
collection PubMed
description Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines are relatively inaccessible (e.g., new assignments only available through database updates), less precise homology-based functional transfer is still the default for (meta-)genome annotation. We, therefore, developed eggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from the eggNOG database. To validate our method, we benchmarked Gene Ontology (GO) predictions against two widely used homology-based approaches: BLAST and InterProScan. Orthology filters applied to BLAST results reduced the rate of false positive assignments by 11%, and increased the ratio of experimentally validated terms recovered over all terms assigned per protein by 15%. Compared with InterProScan, eggNOG-mapper achieved similar proteome coverage and precision while predicting, on average, 41 more terms per protein and increasing the rate of experimentally validated terms recovered over total term assignments per protein by 35%. EggNOG-mapper predictions scored within the top-5 methods in the three GO categories using the CAFA2 NK-partial benchmark. Finally, we evaluated eggNOG-mapper for functional annotation of metagenomics data, yielding better performance than interProScan. eggNOG-mapper runs ∼15× faster than BLAST and at least 2.5× faster than InterProScan. The tool is available standalone and as an online service at http://eggnog-mapper.embl.de.
format Online
Article
Text
id pubmed-5850834
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58508342018-03-23 Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper Huerta-Cepas, Jaime Forslund, Kristoffer Coelho, Luis Pedro Szklarczyk, Damian Jensen, Lars Juhl von Mering, Christian Bork, Peer Mol Biol Evol Resources Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines are relatively inaccessible (e.g., new assignments only available through database updates), less precise homology-based functional transfer is still the default for (meta-)genome annotation. We, therefore, developed eggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from the eggNOG database. To validate our method, we benchmarked Gene Ontology (GO) predictions against two widely used homology-based approaches: BLAST and InterProScan. Orthology filters applied to BLAST results reduced the rate of false positive assignments by 11%, and increased the ratio of experimentally validated terms recovered over all terms assigned per protein by 15%. Compared with InterProScan, eggNOG-mapper achieved similar proteome coverage and precision while predicting, on average, 41 more terms per protein and increasing the rate of experimentally validated terms recovered over total term assignments per protein by 35%. EggNOG-mapper predictions scored within the top-5 methods in the three GO categories using the CAFA2 NK-partial benchmark. Finally, we evaluated eggNOG-mapper for functional annotation of metagenomics data, yielding better performance than interProScan. eggNOG-mapper runs ∼15× faster than BLAST and at least 2.5× faster than InterProScan. The tool is available standalone and as an online service at http://eggnog-mapper.embl.de. Oxford University Press 2017-08 2017-04-29 /pmc/articles/PMC5850834/ /pubmed/28460117 http://dx.doi.org/10.1093/molbev/msx148 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Resources
Huerta-Cepas, Jaime
Forslund, Kristoffer
Coelho, Luis Pedro
Szklarczyk, Damian
Jensen, Lars Juhl
von Mering, Christian
Bork, Peer
Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper
title Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper
title_full Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper
title_fullStr Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper
title_full_unstemmed Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper
title_short Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper
title_sort fast genome-wide functional annotation through orthology assignment by eggnog-mapper
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850834/
https://www.ncbi.nlm.nih.gov/pubmed/28460117
http://dx.doi.org/10.1093/molbev/msx148
work_keys_str_mv AT huertacepasjaime fastgenomewidefunctionalannotationthroughorthologyassignmentbyeggnogmapper
AT forslundkristoffer fastgenomewidefunctionalannotationthroughorthologyassignmentbyeggnogmapper
AT coelholuispedro fastgenomewidefunctionalannotationthroughorthologyassignmentbyeggnogmapper
AT szklarczykdamian fastgenomewidefunctionalannotationthroughorthologyassignmentbyeggnogmapper
AT jensenlarsjuhl fastgenomewidefunctionalannotationthroughorthologyassignmentbyeggnogmapper
AT vonmeringchristian fastgenomewidefunctionalannotationthroughorthologyassignmentbyeggnogmapper
AT borkpeer fastgenomewidefunctionalannotationthroughorthologyassignmentbyeggnogmapper