Cargando…
Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes
BACKGROUND: Metagenomes can be analysed using different approaches and tools. One of the most important distinctions is the way to perform taxonomic and functional assignment, choosing between the use of assembly algorithms or the direct analysis of raw sequence reads instead by homology searching,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6902526/ https://www.ncbi.nlm.nih.gov/pubmed/31823721 http://dx.doi.org/10.1186/s12864-019-6289-6 |
_version_ | 1783477687522689024 |
---|---|
author | Tamames, Javier Cobo-Simón, Marta Puente-Sánchez, Fernando |
author_facet | Tamames, Javier Cobo-Simón, Marta Puente-Sánchez, Fernando |
author_sort | Tamames, Javier |
collection | PubMed |
description | BACKGROUND: Metagenomes can be analysed using different approaches and tools. One of the most important distinctions is the way to perform taxonomic and functional assignment, choosing between the use of assembly algorithms or the direct analysis of raw sequence reads instead by homology searching, k-mer analysys, or detection of marker genes. Many instances of each approach can be found in the literature, but to the best of our knowledge no evaluation of their different performances has been carried on, and we question if their results are comparable. RESULTS: We have analysed several real and mock metagenomes using different methodologies and tools, and compared the resulting taxonomic and functional profiles. Our results show that database completeness (the representation of diverse organisms and taxa in it) is the main factor determining the performance of the methods relying on direct read assignment either by homology, k-mer composition or similarity to marker genes, while methods relying on assembly and assignment of predicted genes are most influenced by metagenomic size, that in turn determines the completeness of the assembly (the percentage of read that were assembled). CONCLUSIONS: Although differences exist, taxonomic profiles are rather similar between raw read assignment and assembly assignment methods, while they are more divergent for methods based on k-mers and marker genes. Regarding functional annotation, analysis of raw reads retrieves more functions, but it also makes a substantial number of over-predictions. Assembly methods are more advantageous as the size of the metagenome grows bigger. |
format | Online Article Text |
id | pubmed-6902526 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69025262019-12-11 Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes Tamames, Javier Cobo-Simón, Marta Puente-Sánchez, Fernando BMC Genomics Research Article BACKGROUND: Metagenomes can be analysed using different approaches and tools. One of the most important distinctions is the way to perform taxonomic and functional assignment, choosing between the use of assembly algorithms or the direct analysis of raw sequence reads instead by homology searching, k-mer analysys, or detection of marker genes. Many instances of each approach can be found in the literature, but to the best of our knowledge no evaluation of their different performances has been carried on, and we question if their results are comparable. RESULTS: We have analysed several real and mock metagenomes using different methodologies and tools, and compared the resulting taxonomic and functional profiles. Our results show that database completeness (the representation of diverse organisms and taxa in it) is the main factor determining the performance of the methods relying on direct read assignment either by homology, k-mer composition or similarity to marker genes, while methods relying on assembly and assignment of predicted genes are most influenced by metagenomic size, that in turn determines the completeness of the assembly (the percentage of read that were assembled). CONCLUSIONS: Although differences exist, taxonomic profiles are rather similar between raw read assignment and assembly assignment methods, while they are more divergent for methods based on k-mers and marker genes. Regarding functional annotation, analysis of raw reads retrieves more functions, but it also makes a substantial number of over-predictions. Assembly methods are more advantageous as the size of the metagenome grows bigger. BioMed Central 2019-12-10 /pmc/articles/PMC6902526/ /pubmed/31823721 http://dx.doi.org/10.1186/s12864-019-6289-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Tamames, Javier Cobo-Simón, Marta Puente-Sánchez, Fernando Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes |
title | Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes |
title_full | Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes |
title_fullStr | Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes |
title_full_unstemmed | Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes |
title_short | Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes |
title_sort | assessing the performance of different approaches for functional and taxonomic annotation of metagenomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6902526/ https://www.ncbi.nlm.nih.gov/pubmed/31823721 http://dx.doi.org/10.1186/s12864-019-6289-6 |
work_keys_str_mv | AT tamamesjavier assessingtheperformanceofdifferentapproachesforfunctionalandtaxonomicannotationofmetagenomes AT cobosimonmarta assessingtheperformanceofdifferentapproachesforfunctionalandtaxonomicannotationofmetagenomes AT puentesanchezfernando assessingtheperformanceofdifferentapproachesforfunctionalandtaxonomicannotationofmetagenomes |