Cargando…

Bayesian mixture analysis for metagenomic community profiling

Motivation: Deep sequencing of clinical samples is now an established tool for the detection of infectious pathogens, with direct medical applications. The large amount of data generated produces an opportunity to detect species even at very low levels, provided that computational tools can effectiv...

Descripción completa

Detalles Bibliográficos
Autores principales: Morfopoulou, Sofia, Plagnol, Vincent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4565032/
https://www.ncbi.nlm.nih.gov/pubmed/26002885
http://dx.doi.org/10.1093/bioinformatics/btv317
_version_ 1782389541767741440
author Morfopoulou, Sofia
Plagnol, Vincent
author_facet Morfopoulou, Sofia
Plagnol, Vincent
author_sort Morfopoulou, Sofia
collection PubMed
description Motivation: Deep sequencing of clinical samples is now an established tool for the detection of infectious pathogens, with direct medical applications. The large amount of data generated produces an opportunity to detect species even at very low levels, provided that computational tools can effectively profile the relevant metagenomic communities. Data interpretation is complicated by the fact that short sequencing reads can match multiple organisms and by the lack of completeness of existing databases, in particular for viral pathogens. Here we present metaMix, a Bayesian mixture model framework for resolving complex metagenomic mixtures. We show that the use of parallel Monte Carlo Markov chains for the exploration of the species space enables the identification of the set of species most likely to contribute to the mixture. Results: We demonstrate the greater accuracy of metaMix compared with relevant methods, particularly for profiling complex communities consisting of several related species. We designed metaMix specifically for the analysis of deep transcriptome sequencing datasets, with a focus on viral pathogen detection; however, the principles are generally applicable to all types of metagenomic mixtures. Availability and implementation: metaMix is implemented as a user friendly R package, freely available on CRAN: http://cran.r-project.org/web/packages/metaMix Contact: sofia.morfopoulou.10@ucl.ac.uk Supplementary information: Supplementary data are available at Bionformatics online.
format Online
Article
Text
id pubmed-4565032
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45650322015-09-18 Bayesian mixture analysis for metagenomic community profiling Morfopoulou, Sofia Plagnol, Vincent Bioinformatics Original Papers Motivation: Deep sequencing of clinical samples is now an established tool for the detection of infectious pathogens, with direct medical applications. The large amount of data generated produces an opportunity to detect species even at very low levels, provided that computational tools can effectively profile the relevant metagenomic communities. Data interpretation is complicated by the fact that short sequencing reads can match multiple organisms and by the lack of completeness of existing databases, in particular for viral pathogens. Here we present metaMix, a Bayesian mixture model framework for resolving complex metagenomic mixtures. We show that the use of parallel Monte Carlo Markov chains for the exploration of the species space enables the identification of the set of species most likely to contribute to the mixture. Results: We demonstrate the greater accuracy of metaMix compared with relevant methods, particularly for profiling complex communities consisting of several related species. We designed metaMix specifically for the analysis of deep transcriptome sequencing datasets, with a focus on viral pathogen detection; however, the principles are generally applicable to all types of metagenomic mixtures. Availability and implementation: metaMix is implemented as a user friendly R package, freely available on CRAN: http://cran.r-project.org/web/packages/metaMix Contact: sofia.morfopoulou.10@ucl.ac.uk Supplementary information: Supplementary data are available at Bionformatics online. Oxford University Press 2015-09-15 2015-05-21 /pmc/articles/PMC4565032/ /pubmed/26002885 http://dx.doi.org/10.1093/bioinformatics/btv317 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Morfopoulou, Sofia
Plagnol, Vincent
Bayesian mixture analysis for metagenomic community profiling
title Bayesian mixture analysis for metagenomic community profiling
title_full Bayesian mixture analysis for metagenomic community profiling
title_fullStr Bayesian mixture analysis for metagenomic community profiling
title_full_unstemmed Bayesian mixture analysis for metagenomic community profiling
title_short Bayesian mixture analysis for metagenomic community profiling
title_sort bayesian mixture analysis for metagenomic community profiling
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4565032/
https://www.ncbi.nlm.nih.gov/pubmed/26002885
http://dx.doi.org/10.1093/bioinformatics/btv317
work_keys_str_mv AT morfopoulousofia bayesianmixtureanalysisformetagenomiccommunityprofiling
AT plagnolvincent bayesianmixtureanalysisformetagenomiccommunityprofiling