Cargando…
Statistical Approach of Functional Profiling for a Microbial Community
BACKGROUND: Metagenomics is a relatively new but fast growing field within environmental biology and medical sciences. It enables researchers to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Traditional methods in genomics and microbiolo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4157783/ https://www.ncbi.nlm.nih.gov/pubmed/25198674 http://dx.doi.org/10.1371/journal.pone.0106588 |
_version_ | 1782333930210328576 |
---|---|
author | An, Lingling Pookhao, Nauromal Jiang, Hongmei Xu, Jiannong |
author_facet | An, Lingling Pookhao, Nauromal Jiang, Hongmei Xu, Jiannong |
author_sort | An, Lingling |
collection | PubMed |
description | BACKGROUND: Metagenomics is a relatively new but fast growing field within environmental biology and medical sciences. It enables researchers to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Traditional methods in genomics and microbiology are not efficient in capturing the structure of the microbial community in an environment. Nowadays, high-throughput next-generation sequencing technologies are powerfully driving the metagenomic studies. However, there is an urgent need to develop efficient statistical methods and computational algorithms to rapidly analyze the massive metagenomic short sequencing data and to accurately detect the features/functions present in the microbial community. Although several issues about functions of metagenomes at pathways or subsystems level have been investigated, there is a lack of studies focusing on functional analysis at a low level of a hierarchical functional tree, such as SEED subsystem tree. RESULTS: A two-step statistical procedure (metaFunction) is proposed to detect all possible functional roles at the low level from a metagenomic sample/community. In the first step a statistical mixture model is proposed at the base of gene codons to estimate the abundances for the candidate functional roles, with sequencing error being considered. As a gene could be involved in multiple biological processes the functional assignment is therefore adjusted by utilizing an error distribution in the second step. The performance of the proposed procedure is evaluated through comprehensive simulation studies. Compared with other existing methods in metagenomic functional analysis the new approach is more accurate in assigning reads to functional roles, and therefore at more general levels. The method is also employed to analyze two real data sets. CONCLUSIONS: metaFunction is a powerful tool in accurate profiling functions in a metagenomic sample. |
format | Online Article Text |
id | pubmed-4157783 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41577832014-09-09 Statistical Approach of Functional Profiling for a Microbial Community An, Lingling Pookhao, Nauromal Jiang, Hongmei Xu, Jiannong PLoS One Research Article BACKGROUND: Metagenomics is a relatively new but fast growing field within environmental biology and medical sciences. It enables researchers to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Traditional methods in genomics and microbiology are not efficient in capturing the structure of the microbial community in an environment. Nowadays, high-throughput next-generation sequencing technologies are powerfully driving the metagenomic studies. However, there is an urgent need to develop efficient statistical methods and computational algorithms to rapidly analyze the massive metagenomic short sequencing data and to accurately detect the features/functions present in the microbial community. Although several issues about functions of metagenomes at pathways or subsystems level have been investigated, there is a lack of studies focusing on functional analysis at a low level of a hierarchical functional tree, such as SEED subsystem tree. RESULTS: A two-step statistical procedure (metaFunction) is proposed to detect all possible functional roles at the low level from a metagenomic sample/community. In the first step a statistical mixture model is proposed at the base of gene codons to estimate the abundances for the candidate functional roles, with sequencing error being considered. As a gene could be involved in multiple biological processes the functional assignment is therefore adjusted by utilizing an error distribution in the second step. The performance of the proposed procedure is evaluated through comprehensive simulation studies. Compared with other existing methods in metagenomic functional analysis the new approach is more accurate in assigning reads to functional roles, and therefore at more general levels. The method is also employed to analyze two real data sets. CONCLUSIONS: metaFunction is a powerful tool in accurate profiling functions in a metagenomic sample. Public Library of Science 2014-09-08 /pmc/articles/PMC4157783/ /pubmed/25198674 http://dx.doi.org/10.1371/journal.pone.0106588 Text en © 2014 An et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article An, Lingling Pookhao, Nauromal Jiang, Hongmei Xu, Jiannong Statistical Approach of Functional Profiling for a Microbial Community |
title | Statistical Approach of Functional Profiling for a Microbial Community |
title_full | Statistical Approach of Functional Profiling for a Microbial Community |
title_fullStr | Statistical Approach of Functional Profiling for a Microbial Community |
title_full_unstemmed | Statistical Approach of Functional Profiling for a Microbial Community |
title_short | Statistical Approach of Functional Profiling for a Microbial Community |
title_sort | statistical approach of functional profiling for a microbial community |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4157783/ https://www.ncbi.nlm.nih.gov/pubmed/25198674 http://dx.doi.org/10.1371/journal.pone.0106588 |
work_keys_str_mv | AT anlingling statisticalapproachoffunctionalprofilingforamicrobialcommunity AT pookhaonauromal statisticalapproachoffunctionalprofilingforamicrobialcommunity AT jianghongmei statisticalapproachoffunctionalprofilingforamicrobialcommunity AT xujiannong statisticalapproachoffunctionalprofilingforamicrobialcommunity |