Cargando…

HirBin: high-resolution identification of differentially abundant functions in metagenomes

BACKGROUND: Gene-centric analysis of metagenomics data provides information about the biochemical functions present in a microbiome under a certain condition. The ability to identify significant differences in functions between metagenomes is dependent on accurate classification and quantification o...

Descripción completa

Detalles Bibliográficos
Autores principales: Österlund, Tobias, Jonsson, Viktor, Kristiansson, Erik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399828/
https://www.ncbi.nlm.nih.gov/pubmed/28431529
http://dx.doi.org/10.1186/s12864-017-3686-6
_version_ 1783230714515292160
author Österlund, Tobias
Jonsson, Viktor
Kristiansson, Erik
author_facet Österlund, Tobias
Jonsson, Viktor
Kristiansson, Erik
author_sort Österlund, Tobias
collection PubMed
description BACKGROUND: Gene-centric analysis of metagenomics data provides information about the biochemical functions present in a microbiome under a certain condition. The ability to identify significant differences in functions between metagenomes is dependent on accurate classification and quantification of the sequence reads (binning). However, biological effects acting on specific functions may be overlooked if the classes are too general. METHODS: Here we introduce High-Resolution Binning (HirBin), a new method for gene-centric analysis of metagenomes. HirBin combines supervised annotation with unsupervised clustering to bin sequence reads at a higher resolution. The supervised annotation is performed by matching sequence fragments to genes using well-established protein domains, such as TIGRFAM, PFAM or COGs, followed by unsupervised clustering where each functional domain is further divided into sub-bins based on sequence similarity. Finally, differential abundance of the sub-bins is statistically assessed. RESULTS: We show that HirBin is able to identify biological effects that are only present at more specific functional levels. Furthermore we show that changes affecting more specific functional levels are often diluted at the more general level and therefore overlooked when analyzed using standard binning approaches. CONCLUSIONS: HirBin improves the resolution of the gene-centric analysis of metagenomes and facilitates the biological interpretation of the results. HirBin is implemented as a Python package and is freely available for download at http://bioinformatics.math.chalmers.se/hirbin. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3686-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5399828
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53998282017-04-24 HirBin: high-resolution identification of differentially abundant functions in metagenomes Österlund, Tobias Jonsson, Viktor Kristiansson, Erik BMC Genomics Methodology Article BACKGROUND: Gene-centric analysis of metagenomics data provides information about the biochemical functions present in a microbiome under a certain condition. The ability to identify significant differences in functions between metagenomes is dependent on accurate classification and quantification of the sequence reads (binning). However, biological effects acting on specific functions may be overlooked if the classes are too general. METHODS: Here we introduce High-Resolution Binning (HirBin), a new method for gene-centric analysis of metagenomes. HirBin combines supervised annotation with unsupervised clustering to bin sequence reads at a higher resolution. The supervised annotation is performed by matching sequence fragments to genes using well-established protein domains, such as TIGRFAM, PFAM or COGs, followed by unsupervised clustering where each functional domain is further divided into sub-bins based on sequence similarity. Finally, differential abundance of the sub-bins is statistically assessed. RESULTS: We show that HirBin is able to identify biological effects that are only present at more specific functional levels. Furthermore we show that changes affecting more specific functional levels are often diluted at the more general level and therefore overlooked when analyzed using standard binning approaches. CONCLUSIONS: HirBin improves the resolution of the gene-centric analysis of metagenomes and facilitates the biological interpretation of the results. HirBin is implemented as a Python package and is freely available for download at http://bioinformatics.math.chalmers.se/hirbin. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3686-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-04-21 /pmc/articles/PMC5399828/ /pubmed/28431529 http://dx.doi.org/10.1186/s12864-017-3686-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Österlund, Tobias
Jonsson, Viktor
Kristiansson, Erik
HirBin: high-resolution identification of differentially abundant functions in metagenomes
title HirBin: high-resolution identification of differentially abundant functions in metagenomes
title_full HirBin: high-resolution identification of differentially abundant functions in metagenomes
title_fullStr HirBin: high-resolution identification of differentially abundant functions in metagenomes
title_full_unstemmed HirBin: high-resolution identification of differentially abundant functions in metagenomes
title_short HirBin: high-resolution identification of differentially abundant functions in metagenomes
title_sort hirbin: high-resolution identification of differentially abundant functions in metagenomes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399828/
https://www.ncbi.nlm.nih.gov/pubmed/28431529
http://dx.doi.org/10.1186/s12864-017-3686-6
work_keys_str_mv AT osterlundtobias hirbinhighresolutionidentificationofdifferentiallyabundantfunctionsinmetagenomes
AT jonssonviktor hirbinhighresolutionidentificationofdifferentiallyabundantfunctionsinmetagenomes
AT kristianssonerik hirbinhighresolutionidentificationofdifferentiallyabundantfunctionsinmetagenomes