Cargando…

Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns

High-throughput sequencing of 16S ribosomal RNA gene amplicons has facilitated understanding of complex microbial communities, but the inherent noise in PCR and DNA sequencing limits differentiation of closely related bacteria. Although many scientific questions can be addressed with broad taxonomic...

Descripción completa

Detalles Bibliográficos
Autores principales: Amir, Amnon, McDonald, Daniel, Navas-Molina, Jose A., Kopylova, Evguenia, Morton, James T., Zech Xu, Zhenjiang, Kightley, Eric P., Thompson, Luke R., Hyde, Embriette R., Gonzalez, Antonio, Knight, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340863/
https://www.ncbi.nlm.nih.gov/pubmed/28289731
http://dx.doi.org/10.1128/mSystems.00191-16
_version_ 1782512885830778880
author Amir, Amnon
McDonald, Daniel
Navas-Molina, Jose A.
Kopylova, Evguenia
Morton, James T.
Zech Xu, Zhenjiang
Kightley, Eric P.
Thompson, Luke R.
Hyde, Embriette R.
Gonzalez, Antonio
Knight, Rob
author_facet Amir, Amnon
McDonald, Daniel
Navas-Molina, Jose A.
Kopylova, Evguenia
Morton, James T.
Zech Xu, Zhenjiang
Kightley, Eric P.
Thompson, Luke R.
Hyde, Embriette R.
Gonzalez, Antonio
Knight, Rob
author_sort Amir, Amnon
collection PubMed
description High-throughput sequencing of 16S ribosomal RNA gene amplicons has facilitated understanding of complex microbial communities, but the inherent noise in PCR and DNA sequencing limits differentiation of closely related bacteria. Although many scientific questions can be addressed with broad taxonomic profiles, clinical, food safety, and some ecological applications require higher specificity. Here we introduce a novel sub-operational-taxonomic-unit (sOTU) approach, Deblur, that uses error profiles to obtain putative error-free sequences from Illumina MiSeq and HiSeq sequencing platforms. Deblur substantially reduces computational demands relative to similar sOTU methods and does so with similar or better sensitivity and specificity. Using simulations, mock mixtures, and real data sets, we detected closely related bacterial sequences with single nucleotide differences while removing false positives and maintaining stability in detection, suggesting that Deblur is limited only by read length and diversity within the amplicon sequences. Because Deblur operates on a per-sample level, it scales to modern data sets and meta-analyses. To highlight Deblur’s ability to integrate data sets, we include an interactive exploration of its application to multiple distinct sequencing rounds of the American Gut Project. Deblur is open source under the Berkeley Software Distribution (BSD) license, easily installable, and downloadable from https://github.com/biocore/deblur. IMPORTANCE Deblur provides a rapid and sensitive means to assess ecological patterns driven by differentiation of closely related taxa. This algorithm provides a solution to the problem of identifying real ecological differences between taxa whose amplicons differ by a single base pair, is applicable in an automated fashion to large-scale sequencing data sets, and can integrate sequencing runs collected over time.
format Online
Article
Text
id pubmed-5340863
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-53408632017-03-13 Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns Amir, Amnon McDonald, Daniel Navas-Molina, Jose A. Kopylova, Evguenia Morton, James T. Zech Xu, Zhenjiang Kightley, Eric P. Thompson, Luke R. Hyde, Embriette R. Gonzalez, Antonio Knight, Rob mSystems Observation High-throughput sequencing of 16S ribosomal RNA gene amplicons has facilitated understanding of complex microbial communities, but the inherent noise in PCR and DNA sequencing limits differentiation of closely related bacteria. Although many scientific questions can be addressed with broad taxonomic profiles, clinical, food safety, and some ecological applications require higher specificity. Here we introduce a novel sub-operational-taxonomic-unit (sOTU) approach, Deblur, that uses error profiles to obtain putative error-free sequences from Illumina MiSeq and HiSeq sequencing platforms. Deblur substantially reduces computational demands relative to similar sOTU methods and does so with similar or better sensitivity and specificity. Using simulations, mock mixtures, and real data sets, we detected closely related bacterial sequences with single nucleotide differences while removing false positives and maintaining stability in detection, suggesting that Deblur is limited only by read length and diversity within the amplicon sequences. Because Deblur operates on a per-sample level, it scales to modern data sets and meta-analyses. To highlight Deblur’s ability to integrate data sets, we include an interactive exploration of its application to multiple distinct sequencing rounds of the American Gut Project. Deblur is open source under the Berkeley Software Distribution (BSD) license, easily installable, and downloadable from https://github.com/biocore/deblur. IMPORTANCE Deblur provides a rapid and sensitive means to assess ecological patterns driven by differentiation of closely related taxa. This algorithm provides a solution to the problem of identifying real ecological differences between taxa whose amplicons differ by a single base pair, is applicable in an automated fashion to large-scale sequencing data sets, and can integrate sequencing runs collected over time. American Society for Microbiology 2017-03-07 /pmc/articles/PMC5340863/ /pubmed/28289731 http://dx.doi.org/10.1128/mSystems.00191-16 Text en Copyright © 2017 Amir et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (http://creativecommons.org/licenses/by/4.0/) .
spellingShingle Observation
Amir, Amnon
McDonald, Daniel
Navas-Molina, Jose A.
Kopylova, Evguenia
Morton, James T.
Zech Xu, Zhenjiang
Kightley, Eric P.
Thompson, Luke R.
Hyde, Embriette R.
Gonzalez, Antonio
Knight, Rob
Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
title Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
title_full Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
title_fullStr Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
title_full_unstemmed Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
title_short Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns
title_sort deblur rapidly resolves single-nucleotide community sequence patterns
topic Observation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340863/
https://www.ncbi.nlm.nih.gov/pubmed/28289731
http://dx.doi.org/10.1128/mSystems.00191-16
work_keys_str_mv AT amiramnon deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT mcdonalddaniel deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT navasmolinajosea deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT kopylovaevguenia deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT mortonjamest deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT zechxuzhenjiang deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT kightleyericp deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT thompsonluker deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT hydeembrietter deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT gonzalezantonio deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns
AT knightrob deblurrapidlyresolvessinglenucleotidecommunitysequencepatterns