Cargando…

Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes

BACKGROUND: Metagenomes can be analysed using different approaches and tools. One of the most important distinctions is the way to perform taxonomic and functional assignment, choosing between the use of assembly algorithms or the direct analysis of raw sequence reads instead by homology searching,...

Descripción completa

Detalles Bibliográficos
Autores principales: Tamames, Javier, Cobo-Simón, Marta, Puente-Sánchez, Fernando
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6902526/
https://www.ncbi.nlm.nih.gov/pubmed/31823721
http://dx.doi.org/10.1186/s12864-019-6289-6
_version_ 1783477687522689024
author Tamames, Javier
Cobo-Simón, Marta
Puente-Sánchez, Fernando
author_facet Tamames, Javier
Cobo-Simón, Marta
Puente-Sánchez, Fernando
author_sort Tamames, Javier
collection PubMed
description BACKGROUND: Metagenomes can be analysed using different approaches and tools. One of the most important distinctions is the way to perform taxonomic and functional assignment, choosing between the use of assembly algorithms or the direct analysis of raw sequence reads instead by homology searching, k-mer analysys, or detection of marker genes. Many instances of each approach can be found in the literature, but to the best of our knowledge no evaluation of their different performances has been carried on, and we question if their results are comparable. RESULTS: We have analysed several real and mock metagenomes using different methodologies and tools, and compared the resulting taxonomic and functional profiles. Our results show that database completeness (the representation of diverse organisms and taxa in it) is the main factor determining the performance of the methods relying on direct read assignment either by homology, k-mer composition or similarity to marker genes, while methods relying on assembly and assignment of predicted genes are most influenced by metagenomic size, that in turn determines the completeness of the assembly (the percentage of read that were assembled). CONCLUSIONS: Although differences exist, taxonomic profiles are rather similar between raw read assignment and assembly assignment methods, while they are more divergent for methods based on k-mers and marker genes. Regarding functional annotation, analysis of raw reads retrieves more functions, but it also makes a substantial number of over-predictions. Assembly methods are more advantageous as the size of the metagenome grows bigger.
format Online
Article
Text
id pubmed-6902526
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69025262019-12-11 Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes Tamames, Javier Cobo-Simón, Marta Puente-Sánchez, Fernando BMC Genomics Research Article BACKGROUND: Metagenomes can be analysed using different approaches and tools. One of the most important distinctions is the way to perform taxonomic and functional assignment, choosing between the use of assembly algorithms or the direct analysis of raw sequence reads instead by homology searching, k-mer analysys, or detection of marker genes. Many instances of each approach can be found in the literature, but to the best of our knowledge no evaluation of their different performances has been carried on, and we question if their results are comparable. RESULTS: We have analysed several real and mock metagenomes using different methodologies and tools, and compared the resulting taxonomic and functional profiles. Our results show that database completeness (the representation of diverse organisms and taxa in it) is the main factor determining the performance of the methods relying on direct read assignment either by homology, k-mer composition or similarity to marker genes, while methods relying on assembly and assignment of predicted genes are most influenced by metagenomic size, that in turn determines the completeness of the assembly (the percentage of read that were assembled). CONCLUSIONS: Although differences exist, taxonomic profiles are rather similar between raw read assignment and assembly assignment methods, while they are more divergent for methods based on k-mers and marker genes. Regarding functional annotation, analysis of raw reads retrieves more functions, but it also makes a substantial number of over-predictions. Assembly methods are more advantageous as the size of the metagenome grows bigger. BioMed Central 2019-12-10 /pmc/articles/PMC6902526/ /pubmed/31823721 http://dx.doi.org/10.1186/s12864-019-6289-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Tamames, Javier
Cobo-Simón, Marta
Puente-Sánchez, Fernando
Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
title Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
title_full Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
title_fullStr Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
title_full_unstemmed Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
title_short Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
title_sort assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6902526/
https://www.ncbi.nlm.nih.gov/pubmed/31823721
http://dx.doi.org/10.1186/s12864-019-6289-6
work_keys_str_mv AT tamamesjavier assessingtheperformanceofdifferentapproachesforfunctionalandtaxonomicannotationofmetagenomes
AT cobosimonmarta assessingtheperformanceofdifferentapproachesforfunctionalandtaxonomicannotationofmetagenomes
AT puentesanchezfernando assessingtheperformanceofdifferentapproachesforfunctionalandtaxonomicannotationofmetagenomes