Cargando…

Re-purposing software for functional characterization of the microbiome

BACKGROUND: Widespread bioinformatic resource development generates a constantly evolving and abundant landscape of workflows and software. For analysis of the microbiome, workflows typically begin with taxonomic classification of the microorganisms that are present in a given environment. Additiona...

Descripción completa

Detalles Bibliográficos
Autores principales: Gardiner, Laura-Jayne, Haiminen, Niina, Utro, Filippo, Parida, Laxmi, Seabolt, Ed, Krishna, Ritesh, Kaufman, James H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7797099/
https://www.ncbi.nlm.nih.gov/pubmed/33422152
http://dx.doi.org/10.1186/s40168-020-00971-1
_version_ 1783634801885970432
author Gardiner, Laura-Jayne
Haiminen, Niina
Utro, Filippo
Parida, Laxmi
Seabolt, Ed
Krishna, Ritesh
Kaufman, James H.
author_facet Gardiner, Laura-Jayne
Haiminen, Niina
Utro, Filippo
Parida, Laxmi
Seabolt, Ed
Krishna, Ritesh
Kaufman, James H.
author_sort Gardiner, Laura-Jayne
collection PubMed
description BACKGROUND: Widespread bioinformatic resource development generates a constantly evolving and abundant landscape of workflows and software. For analysis of the microbiome, workflows typically begin with taxonomic classification of the microorganisms that are present in a given environment. Additional investigation is then required to uncover the functionality of the microbial community, in order to characterize its currently or potentially active biological processes. Such functional analysis of metagenomic data can be computationally demanding for high-throughput sequencing experiments. Instead, we can directly compare sequencing reads to a functionally annotated database. However, since reads frequently match multiple sequences equally well, analyses benefit from a hierarchical annotation tree, e.g. for taxonomic classification where reads are assigned to the lowest taxonomic unit. RESULTS: To facilitate functional microbiome analysis, we re-purpose well-known taxonomic classification tools to allow us to perform direct functional sequencing read classification with the added benefit of a functional hierarchy. To enable this, we develop and present a tree-shaped functional hierarchy representing the molecular function subset of the Gene Ontology annotation structure. We use this functional hierarchy to replace the standard phylogenetic taxonomy used by the classification tools and assign query sequences accurately to the lowest possible molecular function in the tree. We demonstrate this with simulated and experimental datasets, where we reveal new biological insights. CONCLUSIONS: We demonstrate that improved functional classification of metagenomic sequencing reads is possible by re-purposing a range of taxonomic classification tools that are already well-established, in conjunction with either protein or nucleotide reference databases. We leverage the advances in speed, accuracy and efficiency that have been made for taxonomic classification and translate these benefits for the rapid functional classification of microbiomes. While we focus on a specific set of commonly used methods, the functional annotation approach has broad applicability across other sequence classification tools. We hope that re-purposing becomes a routine consideration during bioinformatic resource development. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-020-00971-1.
format Online
Article
Text
id pubmed-7797099
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77970992021-01-11 Re-purposing software for functional characterization of the microbiome Gardiner, Laura-Jayne Haiminen, Niina Utro, Filippo Parida, Laxmi Seabolt, Ed Krishna, Ritesh Kaufman, James H. Microbiome Short Report BACKGROUND: Widespread bioinformatic resource development generates a constantly evolving and abundant landscape of workflows and software. For analysis of the microbiome, workflows typically begin with taxonomic classification of the microorganisms that are present in a given environment. Additional investigation is then required to uncover the functionality of the microbial community, in order to characterize its currently or potentially active biological processes. Such functional analysis of metagenomic data can be computationally demanding for high-throughput sequencing experiments. Instead, we can directly compare sequencing reads to a functionally annotated database. However, since reads frequently match multiple sequences equally well, analyses benefit from a hierarchical annotation tree, e.g. for taxonomic classification where reads are assigned to the lowest taxonomic unit. RESULTS: To facilitate functional microbiome analysis, we re-purpose well-known taxonomic classification tools to allow us to perform direct functional sequencing read classification with the added benefit of a functional hierarchy. To enable this, we develop and present a tree-shaped functional hierarchy representing the molecular function subset of the Gene Ontology annotation structure. We use this functional hierarchy to replace the standard phylogenetic taxonomy used by the classification tools and assign query sequences accurately to the lowest possible molecular function in the tree. We demonstrate this with simulated and experimental datasets, where we reveal new biological insights. CONCLUSIONS: We demonstrate that improved functional classification of metagenomic sequencing reads is possible by re-purposing a range of taxonomic classification tools that are already well-established, in conjunction with either protein or nucleotide reference databases. We leverage the advances in speed, accuracy and efficiency that have been made for taxonomic classification and translate these benefits for the rapid functional classification of microbiomes. While we focus on a specific set of commonly used methods, the functional annotation approach has broad applicability across other sequence classification tools. We hope that re-purposing becomes a routine consideration during bioinformatic resource development. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-020-00971-1. BioMed Central 2021-01-09 /pmc/articles/PMC7797099/ /pubmed/33422152 http://dx.doi.org/10.1186/s40168-020-00971-1 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Short Report
Gardiner, Laura-Jayne
Haiminen, Niina
Utro, Filippo
Parida, Laxmi
Seabolt, Ed
Krishna, Ritesh
Kaufman, James H.
Re-purposing software for functional characterization of the microbiome
title Re-purposing software for functional characterization of the microbiome
title_full Re-purposing software for functional characterization of the microbiome
title_fullStr Re-purposing software for functional characterization of the microbiome
title_full_unstemmed Re-purposing software for functional characterization of the microbiome
title_short Re-purposing software for functional characterization of the microbiome
title_sort re-purposing software for functional characterization of the microbiome
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7797099/
https://www.ncbi.nlm.nih.gov/pubmed/33422152
http://dx.doi.org/10.1186/s40168-020-00971-1
work_keys_str_mv AT gardinerlaurajayne repurposingsoftwareforfunctionalcharacterizationofthemicrobiome
AT haiminenniina repurposingsoftwareforfunctionalcharacterizationofthemicrobiome
AT utrofilippo repurposingsoftwareforfunctionalcharacterizationofthemicrobiome
AT paridalaxmi repurposingsoftwareforfunctionalcharacterizationofthemicrobiome
AT seabolted repurposingsoftwareforfunctionalcharacterizationofthemicrobiome
AT krishnaritesh repurposingsoftwareforfunctionalcharacterizationofthemicrobiome
AT kaufmanjamesh repurposingsoftwareforfunctionalcharacterizationofthemicrobiome