Cargando…
HirBin: high-resolution identification of differentially abundant functions in metagenomes
BACKGROUND: Gene-centric analysis of metagenomics data provides information about the biochemical functions present in a microbiome under a certain condition. The ability to identify significant differences in functions between metagenomes is dependent on accurate classification and quantification o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399828/ https://www.ncbi.nlm.nih.gov/pubmed/28431529 http://dx.doi.org/10.1186/s12864-017-3686-6 |
_version_ | 1783230714515292160 |
---|---|
author | Österlund, Tobias Jonsson, Viktor Kristiansson, Erik |
author_facet | Österlund, Tobias Jonsson, Viktor Kristiansson, Erik |
author_sort | Österlund, Tobias |
collection | PubMed |
description | BACKGROUND: Gene-centric analysis of metagenomics data provides information about the biochemical functions present in a microbiome under a certain condition. The ability to identify significant differences in functions between metagenomes is dependent on accurate classification and quantification of the sequence reads (binning). However, biological effects acting on specific functions may be overlooked if the classes are too general. METHODS: Here we introduce High-Resolution Binning (HirBin), a new method for gene-centric analysis of metagenomes. HirBin combines supervised annotation with unsupervised clustering to bin sequence reads at a higher resolution. The supervised annotation is performed by matching sequence fragments to genes using well-established protein domains, such as TIGRFAM, PFAM or COGs, followed by unsupervised clustering where each functional domain is further divided into sub-bins based on sequence similarity. Finally, differential abundance of the sub-bins is statistically assessed. RESULTS: We show that HirBin is able to identify biological effects that are only present at more specific functional levels. Furthermore we show that changes affecting more specific functional levels are often diluted at the more general level and therefore overlooked when analyzed using standard binning approaches. CONCLUSIONS: HirBin improves the resolution of the gene-centric analysis of metagenomes and facilitates the biological interpretation of the results. HirBin is implemented as a Python package and is freely available for download at http://bioinformatics.math.chalmers.se/hirbin. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3686-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5399828 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53998282017-04-24 HirBin: high-resolution identification of differentially abundant functions in metagenomes Österlund, Tobias Jonsson, Viktor Kristiansson, Erik BMC Genomics Methodology Article BACKGROUND: Gene-centric analysis of metagenomics data provides information about the biochemical functions present in a microbiome under a certain condition. The ability to identify significant differences in functions between metagenomes is dependent on accurate classification and quantification of the sequence reads (binning). However, biological effects acting on specific functions may be overlooked if the classes are too general. METHODS: Here we introduce High-Resolution Binning (HirBin), a new method for gene-centric analysis of metagenomes. HirBin combines supervised annotation with unsupervised clustering to bin sequence reads at a higher resolution. The supervised annotation is performed by matching sequence fragments to genes using well-established protein domains, such as TIGRFAM, PFAM or COGs, followed by unsupervised clustering where each functional domain is further divided into sub-bins based on sequence similarity. Finally, differential abundance of the sub-bins is statistically assessed. RESULTS: We show that HirBin is able to identify biological effects that are only present at more specific functional levels. Furthermore we show that changes affecting more specific functional levels are often diluted at the more general level and therefore overlooked when analyzed using standard binning approaches. CONCLUSIONS: HirBin improves the resolution of the gene-centric analysis of metagenomes and facilitates the biological interpretation of the results. HirBin is implemented as a Python package and is freely available for download at http://bioinformatics.math.chalmers.se/hirbin. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3686-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-04-21 /pmc/articles/PMC5399828/ /pubmed/28431529 http://dx.doi.org/10.1186/s12864-017-3686-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Österlund, Tobias Jonsson, Viktor Kristiansson, Erik HirBin: high-resolution identification of differentially abundant functions in metagenomes |
title | HirBin: high-resolution identification of differentially abundant functions in metagenomes |
title_full | HirBin: high-resolution identification of differentially abundant functions in metagenomes |
title_fullStr | HirBin: high-resolution identification of differentially abundant functions in metagenomes |
title_full_unstemmed | HirBin: high-resolution identification of differentially abundant functions in metagenomes |
title_short | HirBin: high-resolution identification of differentially abundant functions in metagenomes |
title_sort | hirbin: high-resolution identification of differentially abundant functions in metagenomes |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399828/ https://www.ncbi.nlm.nih.gov/pubmed/28431529 http://dx.doi.org/10.1186/s12864-017-3686-6 |
work_keys_str_mv | AT osterlundtobias hirbinhighresolutionidentificationofdifferentiallyabundantfunctionsinmetagenomes AT jonssonviktor hirbinhighresolutionidentificationofdifferentiallyabundantfunctionsinmetagenomes AT kristianssonerik hirbinhighresolutionidentificationofdifferentiallyabundantfunctionsinmetagenomes |