Cargando…
mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences
BACKGROUND: Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3971603/ https://www.ncbi.nlm.nih.gov/pubmed/24451012 http://dx.doi.org/10.1186/2049-2618-1-23 |
_version_ | 1782309500830613504 |
---|---|
author | Links, Matthew G Chaban, Bonnie Hemmingsen, Sean M Muirhead, Kevin Hill, Janet E |
author_facet | Links, Matthew G Chaban, Bonnie Hemmingsen, Sean M Muirhead, Kevin Hill, Janet E |
author_sort | Links, Matthew G |
collection | PubMed |
description | BACKGROUND: Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. RESULTS: Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. CONCLUSIONS: mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly. |
format | Online Article Text |
id | pubmed-3971603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-39716032014-04-02 mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences Links, Matthew G Chaban, Bonnie Hemmingsen, Sean M Muirhead, Kevin Hill, Janet E Microbiome Methodology BACKGROUND: Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. RESULTS: Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. CONCLUSIONS: mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly. BioMed Central 2013-08-15 /pmc/articles/PMC3971603/ /pubmed/24451012 http://dx.doi.org/10.1186/2049-2618-1-23 Text en Copyright © 2013 Links et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Links, Matthew G Chaban, Bonnie Hemmingsen, Sean M Muirhead, Kevin Hill, Janet E mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences |
title | mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences |
title_full | mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences |
title_fullStr | mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences |
title_full_unstemmed | mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences |
title_short | mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences |
title_sort | mpuma: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3971603/ https://www.ncbi.nlm.nih.gov/pubmed/24451012 http://dx.doi.org/10.1186/2049-2618-1-23 |
work_keys_str_mv | AT linksmatthewg mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences AT chabanbonnie mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences AT hemmingsenseanm mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences AT muirheadkevin mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences AT hilljanete mpumaacomputationalapproachtomicrobiotaanalysisbydenovoassemblyofoperationaltaxonomicunitsbasedonproteincodingbarcodesequences |