Cargando…

mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences

BACKGROUND: Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clu...

Descripción completa

Detalles Bibliográficos
Autores principales: Links, Matthew G, Chaban, Bonnie, Hemmingsen, Sean M, Muirhead, Kevin, Hill, Janet E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3971603/
https://www.ncbi.nlm.nih.gov/pubmed/24451012
http://dx.doi.org/10.1186/2049-2618-1-23
_version_ 1782309500830613504
author Links, Matthew G
Chaban, Bonnie
Hemmingsen, Sean M
Muirhead, Kevin
Hill, Janet E
author_facet Links, Matthew G
Chaban, Bonnie
Hemmingsen, Sean M
Muirhead, Kevin
Hill, Janet E
author_sort Links, Matthew G
collection PubMed
description BACKGROUND: Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. RESULTS: Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. CONCLUSIONS: mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly.
format Online
Article
Text
id pubmed-3971603
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39716032014-04-02 mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences Links, Matthew G Chaban, Bonnie Hemmingsen, Sean M Muirhead, Kevin Hill, Janet E Microbiome Methodology BACKGROUND: Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. RESULTS: Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. CONCLUSIONS: mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly. BioMed Central 2013-08-15 /pmc/articles/PMC3971603/ /pubmed/24451012 http://dx.doi.org/10.1186/2049-2618-1-23 Text en Copyright © 2013 Links et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Links, Matthew G
Chaban, Bonnie
Hemmingsen, Sean M
Muirhead, Kevin
Hill, Janet E
mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
title mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
title_full mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
title_fullStr mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
title_full_unstemmed mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
title_short mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
title_sort mpuma: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3971603/
https://www.ncbi.nlm.nih.gov/pubmed/24451012
http://dx.doi.org/10.1186/2049-2618-1-23
work_keys_str_mv AT linksmatthewg mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences
AT chabanbonnie mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences
AT hemmingsenseanm mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences
AT muirheadkevin mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences
AT hilljanete mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences