Cargando…

Accuracy and quality of massively parallel DNA pyrosequencing

BACKGROUND: Massively parallel pyrosequencing systems have increased the efficiency of DNA sequencing, although the published per-base accuracy of a Roche GS20 is only 96%. In genome projects, highly redundant consensus assemblies can compensate for sequencing errors. In contrast, studies of microbi...

Descripción completa

Detalles Bibliográficos
Autores principales: Huse, Susan M, Huber, Julie A, Morrison, Hilary G, Sogin, Mitchell L, Welch, David Mark
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2323236/
https://www.ncbi.nlm.nih.gov/pubmed/17659080
http://dx.doi.org/10.1186/gb-2007-8-7-r143
_version_ 1782152630225600512
author Huse, Susan M
Huber, Julie A
Morrison, Hilary G
Sogin, Mitchell L
Welch, David Mark
author_facet Huse, Susan M
Huber, Julie A
Morrison, Hilary G
Sogin, Mitchell L
Welch, David Mark
author_sort Huse, Susan M
collection PubMed
description BACKGROUND: Massively parallel pyrosequencing systems have increased the efficiency of DNA sequencing, although the published per-base accuracy of a Roche GS20 is only 96%. In genome projects, highly redundant consensus assemblies can compensate for sequencing errors. In contrast, studies of microbial diversity that catalogue differences between PCR amplicons of ribosomal RNA genes (rDNA) or other conserved gene families cannot take advantage of consensus assemblies to detect and minimize incorrect base calls. RESULTS: We performed an empirical study of the per-base error rate for the Roche GS20 system using sequences of the V6 hypervariable region from cloned microbial ribosomal DNA (tag sequencing). We calculated a 99.5% accuracy rate in unassembled sequences, and identified several factors that can be used to remove a small percentage of low-quality reads, improving the accuracy to 99.75% or better. CONCLUSION: By using objective criteria to eliminate low quality data, the quality of individual GS20 sequence reads in molecular ecological applications can surpass the accuracy of traditional capillary methods.
format Text
id pubmed-2323236
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23232362008-04-19 Accuracy and quality of massively parallel DNA pyrosequencing Huse, Susan M Huber, Julie A Morrison, Hilary G Sogin, Mitchell L Welch, David Mark Genome Biol Research BACKGROUND: Massively parallel pyrosequencing systems have increased the efficiency of DNA sequencing, although the published per-base accuracy of a Roche GS20 is only 96%. In genome projects, highly redundant consensus assemblies can compensate for sequencing errors. In contrast, studies of microbial diversity that catalogue differences between PCR amplicons of ribosomal RNA genes (rDNA) or other conserved gene families cannot take advantage of consensus assemblies to detect and minimize incorrect base calls. RESULTS: We performed an empirical study of the per-base error rate for the Roche GS20 system using sequences of the V6 hypervariable region from cloned microbial ribosomal DNA (tag sequencing). We calculated a 99.5% accuracy rate in unassembled sequences, and identified several factors that can be used to remove a small percentage of low-quality reads, improving the accuracy to 99.75% or better. CONCLUSION: By using objective criteria to eliminate low quality data, the quality of individual GS20 sequence reads in molecular ecological applications can surpass the accuracy of traditional capillary methods. BioMed Central 2007 2007-07-20 /pmc/articles/PMC2323236/ /pubmed/17659080 http://dx.doi.org/10.1186/gb-2007-8-7-r143 Text en Copyright © 2007 Huse et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Huse, Susan M
Huber, Julie A
Morrison, Hilary G
Sogin, Mitchell L
Welch, David Mark
Accuracy and quality of massively parallel DNA pyrosequencing
title Accuracy and quality of massively parallel DNA pyrosequencing
title_full Accuracy and quality of massively parallel DNA pyrosequencing
title_fullStr Accuracy and quality of massively parallel DNA pyrosequencing
title_full_unstemmed Accuracy and quality of massively parallel DNA pyrosequencing
title_short Accuracy and quality of massively parallel DNA pyrosequencing
title_sort accuracy and quality of massively parallel dna pyrosequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2323236/
https://www.ncbi.nlm.nih.gov/pubmed/17659080
http://dx.doi.org/10.1186/gb-2007-8-7-r143
work_keys_str_mv AT husesusanm accuracyandqualityofmassivelyparalleldnapyrosequencing
AT huberjuliea accuracyandqualityofmassivelyparalleldnapyrosequencing
AT morrisonhilaryg accuracyandqualityofmassivelyparalleldnapyrosequencing
AT soginmitchelll accuracyandqualityofmassivelyparalleldnapyrosequencing
AT welchdavidmark accuracyandqualityofmassivelyparalleldnapyrosequencing