Cargando…
Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
High-throughput sequencing of 16S ribosomal RNA gene amplicons has facilitated understanding of complex microbial communities, but the inherent noise in PCR and DNA sequencing limits differentiation of closely related bacteria. Although many scientific questions can be addressed with broad taxonomic...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340863/ https://www.ncbi.nlm.nih.gov/pubmed/28289731 http://dx.doi.org/10.1128/mSystems.00191-16 |
_version_ | 1782512885830778880 |
---|---|
author | Amir, Amnon McDonald, Daniel Navas-Molina, Jose A. Kopylova, Evguenia Morton, James T. Zech Xu, Zhenjiang Kightley, Eric P. Thompson, Luke R. Hyde, Embriette R. Gonzalez, Antonio Knight, Rob |
author_facet | Amir, Amnon McDonald, Daniel Navas-Molina, Jose A. Kopylova, Evguenia Morton, James T. Zech Xu, Zhenjiang Kightley, Eric P. Thompson, Luke R. Hyde, Embriette R. Gonzalez, Antonio Knight, Rob |
author_sort | Amir, Amnon |
collection | PubMed |
description | High-throughput sequencing of 16S ribosomal RNA gene amplicons has facilitated understanding of complex microbial communities, but the inherent noise in PCR and DNA sequencing limits differentiation of closely related bacteria. Although many scientific questions can be addressed with broad taxonomic profiles, clinical, food safety, and some ecological applications require higher specificity. Here we introduce a novel sub-operational-taxonomic-unit (sOTU) approach, Deblur, that uses error profiles to obtain putative error-free sequences from Illumina MiSeq and HiSeq sequencing platforms. Deblur substantially reduces computational demands relative to similar sOTU methods and does so with similar or better sensitivity and specificity. Using simulations, mock mixtures, and real data sets, we detected closely related bacterial sequences with single nucleotide differences while removing false positives and maintaining stability in detection, suggesting that Deblur is limited only by read length and diversity within the amplicon sequences. Because Deblur operates on a per-sample level, it scales to modern data sets and meta-analyses. To highlight Deblur’s ability to integrate data sets, we include an interactive exploration of its application to multiple distinct sequencing rounds of the American Gut Project. Deblur is open source under the Berkeley Software Distribution (BSD) license, easily installable, and downloadable from https://github.com/biocore/deblur. IMPORTANCE Deblur provides a rapid and sensitive means to assess ecological patterns driven by differentiation of closely related taxa. This algorithm provides a solution to the problem of identifying real ecological differences between taxa whose amplicons differ by a single base pair, is applicable in an automated fashion to large-scale sequencing data sets, and can integrate sequencing runs collected over time. |
format | Online Article Text |
id | pubmed-5340863 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-53408632017-03-13 Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns Amir, Amnon McDonald, Daniel Navas-Molina, Jose A. Kopylova, Evguenia Morton, James T. Zech Xu, Zhenjiang Kightley, Eric P. Thompson, Luke R. Hyde, Embriette R. Gonzalez, Antonio Knight, Rob mSystems Observation High-throughput sequencing of 16S ribosomal RNA gene amplicons has facilitated understanding of complex microbial communities, but the inherent noise in PCR and DNA sequencing limits differentiation of closely related bacteria. Although many scientific questions can be addressed with broad taxonomic profiles, clinical, food safety, and some ecological applications require higher specificity. Here we introduce a novel sub-operational-taxonomic-unit (sOTU) approach, Deblur, that uses error profiles to obtain putative error-free sequences from Illumina MiSeq and HiSeq sequencing platforms. Deblur substantially reduces computational demands relative to similar sOTU methods and does so with similar or better sensitivity and specificity. Using simulations, mock mixtures, and real data sets, we detected closely related bacterial sequences with single nucleotide differences while removing false positives and maintaining stability in detection, suggesting that Deblur is limited only by read length and diversity within the amplicon sequences. Because Deblur operates on a per-sample level, it scales to modern data sets and meta-analyses. To highlight Deblur’s ability to integrate data sets, we include an interactive exploration of its application to multiple distinct sequencing rounds of the American Gut Project. Deblur is open source under the Berkeley Software Distribution (BSD) license, easily installable, and downloadable from https://github.com/biocore/deblur. IMPORTANCE Deblur provides a rapid and sensitive means to assess ecological patterns driven by differentiation of closely related taxa. This algorithm provides a solution to the problem of identifying real ecological differences between taxa whose amplicons differ by a single base pair, is applicable in an automated fashion to large-scale sequencing data sets, and can integrate sequencing runs collected over time. American Society for Microbiology 2017-03-07 /pmc/articles/PMC5340863/ /pubmed/28289731 http://dx.doi.org/10.1128/mSystems.00191-16 Text en Copyright © 2017 Amir et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (http://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Observation Amir, Amnon McDonald, Daniel Navas-Molina, Jose A. Kopylova, Evguenia Morton, James T. Zech Xu, Zhenjiang Kightley, Eric P. Thompson, Luke R. Hyde, Embriette R. Gonzalez, Antonio Knight, Rob Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns |
title | Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns |
title_full | Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns |
title_fullStr | Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns |
title_full_unstemmed | Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns |
title_short | Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns |
title_sort | deblur rapidly resolves single-nucleotide community sequence patterns |
topic | Observation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340863/ https://www.ncbi.nlm.nih.gov/pubmed/28289731 http://dx.doi.org/10.1128/mSystems.00191-16 |
work_keys_str_mv | AT amiramnon deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT mcdonalddaniel deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT navasmolinajosea deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT kopylovaevguenia deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT mortonjamest deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT zechxuzhenjiang deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT kightleyericp deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT thompsonluker deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT hydeembrietter deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT gonzalezantonio deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns AT knightrob deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns |