Cargando…

MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data

As more than 90% of species in a microbial community could not be isolated and cultivated, the metagenomic methods have become one of the most important methods to analyze microbial community as a whole. With the fast accumulation of metagenomic samples and the advance of next-generation sequencing...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xiaojun, Su, Xiaoquan, Cui, Xinping, Ning, Kang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4512773/
https://www.ncbi.nlm.nih.gov/pubmed/26213658
http://dx.doi.org/10.7717/peerj.993
_version_ 1782382558169792512
author Wang, Xiaojun
Su, Xiaoquan
Cui, Xinping
Ning, Kang
author_facet Wang, Xiaojun
Su, Xiaoquan
Cui, Xinping
Ning, Kang
author_sort Wang, Xiaojun
collection PubMed
description As more than 90% of species in a microbial community could not be isolated and cultivated, the metagenomic methods have become one of the most important methods to analyze microbial community as a whole. With the fast accumulation of metagenomic samples and the advance of next-generation sequencing techniques, it is now possible to qualitatively and quantitatively assess all taxa (features) in a microbial community. A set of taxa with presence/absence or their different abundances could potentially be used as taxonomical biomarkers for identification of the corresponding microbial community’s phenotype. Though there exist some bioinformatics methods for metagenomic biomarker discovery, current methods are not robust, accurate and fast enough at selection of non-redundant biomarkers for prediction of microbial community’s phenotype. In this study, we have proposed a novel method, MetaBoot, that combines the techniques of mRMR (minimal redundancy maximal relevance) and bootstrapping, for discover of non-redundant biomarkers for microbial communities through mining of metagenomic data. MetaBoot has been tested and compared with other methods on well-designed simulated datasets considering normal and gamma distribution as well as publicly available metagenomic datasets. Results have shown that MetaBoot was robust across datasets of varied complexity and taxonomical distribution patterns and could also select discriminative biomarkers with quite high accuracy and biological consistency. Thus, MetaBoot is suitable for robustly and accurately discover taxonomical biomarkers for different microbial communities.
format Online
Article
Text
id pubmed-4512773
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-45127732015-07-24 MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data Wang, Xiaojun Su, Xiaoquan Cui, Xinping Ning, Kang PeerJ Bioinformatics As more than 90% of species in a microbial community could not be isolated and cultivated, the metagenomic methods have become one of the most important methods to analyze microbial community as a whole. With the fast accumulation of metagenomic samples and the advance of next-generation sequencing techniques, it is now possible to qualitatively and quantitatively assess all taxa (features) in a microbial community. A set of taxa with presence/absence or their different abundances could potentially be used as taxonomical biomarkers for identification of the corresponding microbial community’s phenotype. Though there exist some bioinformatics methods for metagenomic biomarker discovery, current methods are not robust, accurate and fast enough at selection of non-redundant biomarkers for prediction of microbial community’s phenotype. In this study, we have proposed a novel method, MetaBoot, that combines the techniques of mRMR (minimal redundancy maximal relevance) and bootstrapping, for discover of non-redundant biomarkers for microbial communities through mining of metagenomic data. MetaBoot has been tested and compared with other methods on well-designed simulated datasets considering normal and gamma distribution as well as publicly available metagenomic datasets. Results have shown that MetaBoot was robust across datasets of varied complexity and taxonomical distribution patterns and could also select discriminative biomarkers with quite high accuracy and biological consistency. Thus, MetaBoot is suitable for robustly and accurately discover taxonomical biomarkers for different microbial communities. PeerJ Inc. 2015-07-07 /pmc/articles/PMC4512773/ /pubmed/26213658 http://dx.doi.org/10.7717/peerj.993 Text en © 2015 Wang et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Wang, Xiaojun
Su, Xiaoquan
Cui, Xinping
Ning, Kang
MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data
title MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data
title_full MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data
title_fullStr MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data
title_full_unstemmed MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data
title_short MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data
title_sort metaboot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4512773/
https://www.ncbi.nlm.nih.gov/pubmed/26213658
http://dx.doi.org/10.7717/peerj.993
work_keys_str_mv AT wangxiaojun metabootamachinelearningframeworkoftaxonomicalbiomarkerdiscoveryfordifferentmicrobialcommunitiesbasedonmetagenomicdata
AT suxiaoquan metabootamachinelearningframeworkoftaxonomicalbiomarkerdiscoveryfordifferentmicrobialcommunitiesbasedonmetagenomicdata
AT cuixinping metabootamachinelearningframeworkoftaxonomicalbiomarkerdiscoveryfordifferentmicrobialcommunitiesbasedonmetagenomicdata
AT ningkang metabootamachinelearningframeworkoftaxonomicalbiomarkerdiscoveryfordifferentmicrobialcommunitiesbasedonmetagenomicdata