Cargando…

Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data

Many viruses, including the clinically relevant RNA viruses HIV (human immunodeficiency virus) and HCV (hepatitis C virus), exist in large populations and display high genetic heterogeneity within and between infected hosts. Assessing intra-patient viral genetic diversity is essential for understand...

Descripción completa

Detalles Bibliográficos
Autores principales: Beerenwinkel, Niko, Günthard, Huldrych F., Roth, Volker, Metzner, Karin J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3438994/
https://www.ncbi.nlm.nih.gov/pubmed/22973268
http://dx.doi.org/10.3389/fmicb.2012.00329
_version_ 1782242944234815488
author Beerenwinkel, Niko
Günthard, Huldrych F.
Roth, Volker
Metzner, Karin J.
author_facet Beerenwinkel, Niko
Günthard, Huldrych F.
Roth, Volker
Metzner, Karin J.
author_sort Beerenwinkel, Niko
collection PubMed
description Many viruses, including the clinically relevant RNA viruses HIV (human immunodeficiency virus) and HCV (hepatitis C virus), exist in large populations and display high genetic heterogeneity within and between infected hosts. Assessing intra-patient viral genetic diversity is essential for understanding the evolutionary dynamics of viruses, for designing effective vaccines, and for the success of antiviral therapy. Next-generation sequencing (NGS) technologies allow the rapid and cost-effective acquisition of thousands to millions of short DNA sequences from a single sample. However, this approach entails several challenges in experimental design and computational data analysis. Here, we review the entire process of inferring viral diversity from sample collection to computing measures of genetic diversity. We discuss sample preparation, including reverse transcription and amplification, and the effect of experimental conditions on diversity estimates due to in vitro base substitutions, insertions, deletions, and recombination. The use of different NGS platforms and their sequencing error profiles are compared in the context of various applications of diversity estimation, ranging from the detection of single nucleotide variants (SNVs) to the reconstruction of whole-genome haplotypes. We describe the statistical and computational challenges arising from these technical artifacts, and we review existing approaches, including available software, for their solution. Finally, we discuss open problems, and highlight successful biomedical applications and potential future clinical use of NGS to estimate viral diversity.
format Online
Article
Text
id pubmed-3438994
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-34389942012-09-12 Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data Beerenwinkel, Niko Günthard, Huldrych F. Roth, Volker Metzner, Karin J. Front Microbiol Microbiology Many viruses, including the clinically relevant RNA viruses HIV (human immunodeficiency virus) and HCV (hepatitis C virus), exist in large populations and display high genetic heterogeneity within and between infected hosts. Assessing intra-patient viral genetic diversity is essential for understanding the evolutionary dynamics of viruses, for designing effective vaccines, and for the success of antiviral therapy. Next-generation sequencing (NGS) technologies allow the rapid and cost-effective acquisition of thousands to millions of short DNA sequences from a single sample. However, this approach entails several challenges in experimental design and computational data analysis. Here, we review the entire process of inferring viral diversity from sample collection to computing measures of genetic diversity. We discuss sample preparation, including reverse transcription and amplification, and the effect of experimental conditions on diversity estimates due to in vitro base substitutions, insertions, deletions, and recombination. The use of different NGS platforms and their sequencing error profiles are compared in the context of various applications of diversity estimation, ranging from the detection of single nucleotide variants (SNVs) to the reconstruction of whole-genome haplotypes. We describe the statistical and computational challenges arising from these technical artifacts, and we review existing approaches, including available software, for their solution. Finally, we discuss open problems, and highlight successful biomedical applications and potential future clinical use of NGS to estimate viral diversity. Frontiers Media S.A. 2012-09-11 /pmc/articles/PMC3438994/ /pubmed/22973268 http://dx.doi.org/10.3389/fmicb.2012.00329 Text en Copyright © 2012 Beerenwinkel, Günthard, Roth and Metzner. http://www.frontiersin.org/licenseagreement This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle Microbiology
Beerenwinkel, Niko
Günthard, Huldrych F.
Roth, Volker
Metzner, Karin J.
Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data
title Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data
title_full Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data
title_fullStr Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data
title_full_unstemmed Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data
title_short Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data
title_sort challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3438994/
https://www.ncbi.nlm.nih.gov/pubmed/22973268
http://dx.doi.org/10.3389/fmicb.2012.00329
work_keys_str_mv AT beerenwinkelniko challengesandopportunitiesinestimatingviralgeneticdiversityfromnextgenerationsequencingdata
AT gunthardhuldrychf challengesandopportunitiesinestimatingviralgeneticdiversityfromnextgenerationsequencingdata
AT rothvolker challengesandopportunitiesinestimatingviralgeneticdiversityfromnextgenerationsequencingdata
AT metznerkarinj challengesandopportunitiesinestimatingviralgeneticdiversityfromnextgenerationsequencingdata