Cargando…

Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs

BACKGOUND: High throughput sequencing is beginning to make a transformative impact in the area of viral evolution. Deep sequencing has the potential to reveal the mutant spectrum within a viral sample at high resolution, thus enabling the close examination of viral mutational dynamics both within- a...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen-Harris, Haiyin, Borucki, Monica K, Torres, Clinton, Slezak, Tom R, Allen, Jonathan E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599684/
https://www.ncbi.nlm.nih.gov/pubmed/23402258
http://dx.doi.org/10.1186/1471-2164-14-96
_version_ 1782263026075828224
author Chen-Harris, Haiyin
Borucki, Monica K
Torres, Clinton
Slezak, Tom R
Allen, Jonathan E
author_facet Chen-Harris, Haiyin
Borucki, Monica K
Torres, Clinton
Slezak, Tom R
Allen, Jonathan E
author_sort Chen-Harris, Haiyin
collection PubMed
description BACKGOUND: High throughput sequencing is beginning to make a transformative impact in the area of viral evolution. Deep sequencing has the potential to reveal the mutant spectrum within a viral sample at high resolution, thus enabling the close examination of viral mutational dynamics both within- and between-hosts. The challenge however, is to accurately model the errors in the sequencing data and differentiate real viral mutations, particularly those that exist at low frequencies, from sequencing errors. RESULTS: We demonstrate that overlapping read pairs (ORP) -- generated by combining short fragment sequencing libraries and longer sequencing reads -- significantly reduce sequencing error rates and improve rare variant detection accuracy. Using this sequencing protocol and an error model optimized for variant detection, we are able to capture a large number of genetic mutations present within a viral population at ultra-low frequency levels (<0.05%). CONCLUSIONS: Our rare variant detection strategies have important implications beyond viral evolution and can be applied to any basic and clinical research area that requires the identification of rare mutations.
format Online
Article
Text
id pubmed-3599684
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35996842013-03-23 Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs Chen-Harris, Haiyin Borucki, Monica K Torres, Clinton Slezak, Tom R Allen, Jonathan E BMC Genomics Methodology Article BACKGOUND: High throughput sequencing is beginning to make a transformative impact in the area of viral evolution. Deep sequencing has the potential to reveal the mutant spectrum within a viral sample at high resolution, thus enabling the close examination of viral mutational dynamics both within- and between-hosts. The challenge however, is to accurately model the errors in the sequencing data and differentiate real viral mutations, particularly those that exist at low frequencies, from sequencing errors. RESULTS: We demonstrate that overlapping read pairs (ORP) -- generated by combining short fragment sequencing libraries and longer sequencing reads -- significantly reduce sequencing error rates and improve rare variant detection accuracy. Using this sequencing protocol and an error model optimized for variant detection, we are able to capture a large number of genetic mutations present within a viral population at ultra-low frequency levels (<0.05%). CONCLUSIONS: Our rare variant detection strategies have important implications beyond viral evolution and can be applied to any basic and clinical research area that requires the identification of rare mutations. BioMed Central 2013-02-12 /pmc/articles/PMC3599684/ /pubmed/23402258 http://dx.doi.org/10.1186/1471-2164-14-96 Text en Copyright ©2013 Chen-Harris et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Chen-Harris, Haiyin
Borucki, Monica K
Torres, Clinton
Slezak, Tom R
Allen, Jonathan E
Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs
title Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs
title_full Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs
title_fullStr Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs
title_full_unstemmed Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs
title_short Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs
title_sort ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599684/
https://www.ncbi.nlm.nih.gov/pubmed/23402258
http://dx.doi.org/10.1186/1471-2164-14-96
work_keys_str_mv AT chenharrishaiyin ultradeepmutantspectrumprofilingimprovingsequencingaccuracyusingoverlappingreadpairs
AT boruckimonicak ultradeepmutantspectrumprofilingimprovingsequencingaccuracyusingoverlappingreadpairs
AT torresclinton ultradeepmutantspectrumprofilingimprovingsequencingaccuracyusingoverlappingreadpairs
AT slezaktomr ultradeepmutantspectrumprofilingimprovingsequencingaccuracyusingoverlappingreadpairs
AT allenjonathane ultradeepmutantspectrumprofilingimprovingsequencingaccuracyusingoverlappingreadpairs