Cargando…

Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing

High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, i...

Descripción completa

Detalles Bibliográficos
Autores principales: Morse, Alison M., Calabro, Kaitlyn R., Fear, Justin M., Bloom, David C., McIntyre, Lauren M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5580483/
https://www.ncbi.nlm.nih.gov/pubmed/28812996
http://dx.doi.org/10.3390/v9080226
_version_ 1783260909520551936
author Morse, Alison M.
Calabro, Kaitlyn R.
Fear, Justin M.
Bloom, David C.
McIntyre, Lauren M.
author_facet Morse, Alison M.
Calabro, Kaitlyn R.
Fear, Justin M.
Bloom, David C.
McIntyre, Lauren M.
author_sort Morse, Alison M.
collection PubMed
description High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, is complicated due to the existence of large repeat regions and arrays of smaller reiterated sequences that are commonly found in these genomes. In addition, the inherent genetic variation in populations of isolates for viruses and other microorganisms presents an additional challenge to many existing HTS sequence assembly pipelines. Here, we evaluate two approaches for the identification of genetic variants in HSV1 strains using Illumina short read sequencing data. The first, a reference-based approach, identifies variants from reads aligned to a reference sequence and the second, a de novo assembly approach, identifies variants from reads aligned to de novo assembled consensus sequences. Of critical importance for both approaches is the reduction in the number of low complexity regions through the construction of a non-redundant reference genome. We compared variants identified in the two methods. Our results indicate that approximately 85% of variants are identified regardless of the approach. The reference-based approach to variant discovery captures an additional 15% representing variants divergent from the HSV1 reference possibly due to viral passage. Reference-based approaches are significantly less labor-intensive and identify variants across the genome where de novo assembly-based approaches are limited to regions where contigs have been successfully assembled. In addition, regions of poor quality assembly can lead to false variant identification in de novo consensus sequences. For viruses with a well-assembled reference genome, a reference-based approach is recommended.
format Online
Article
Text
id pubmed-5580483
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-55804832017-09-06 Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing Morse, Alison M. Calabro, Kaitlyn R. Fear, Justin M. Bloom, David C. McIntyre, Lauren M. Viruses Article High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, is complicated due to the existence of large repeat regions and arrays of smaller reiterated sequences that are commonly found in these genomes. In addition, the inherent genetic variation in populations of isolates for viruses and other microorganisms presents an additional challenge to many existing HTS sequence assembly pipelines. Here, we evaluate two approaches for the identification of genetic variants in HSV1 strains using Illumina short read sequencing data. The first, a reference-based approach, identifies variants from reads aligned to a reference sequence and the second, a de novo assembly approach, identifies variants from reads aligned to de novo assembled consensus sequences. Of critical importance for both approaches is the reduction in the number of low complexity regions through the construction of a non-redundant reference genome. We compared variants identified in the two methods. Our results indicate that approximately 85% of variants are identified regardless of the approach. The reference-based approach to variant discovery captures an additional 15% representing variants divergent from the HSV1 reference possibly due to viral passage. Reference-based approaches are significantly less labor-intensive and identify variants across the genome where de novo assembly-based approaches are limited to regions where contigs have been successfully assembled. In addition, regions of poor quality assembly can lead to false variant identification in de novo consensus sequences. For viruses with a well-assembled reference genome, a reference-based approach is recommended. MDPI 2017-08-16 /pmc/articles/PMC5580483/ /pubmed/28812996 http://dx.doi.org/10.3390/v9080226 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Morse, Alison M.
Calabro, Kaitlyn R.
Fear, Justin M.
Bloom, David C.
McIntyre, Lauren M.
Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing
title Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing
title_full Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing
title_fullStr Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing
title_full_unstemmed Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing
title_short Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing
title_sort reliable detection of herpes simplex virus sequence variation by high-throughput resequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5580483/
https://www.ncbi.nlm.nih.gov/pubmed/28812996
http://dx.doi.org/10.3390/v9080226
work_keys_str_mv AT morsealisonm reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing
AT calabrokaitlynr reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing
AT fearjustinm reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing
AT bloomdavidc reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing
AT mcintyrelaurenm reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing