Cargando…
Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing
High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, i...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5580483/ https://www.ncbi.nlm.nih.gov/pubmed/28812996 http://dx.doi.org/10.3390/v9080226 |
_version_ | 1783260909520551936 |
---|---|
author | Morse, Alison M. Calabro, Kaitlyn R. Fear, Justin M. Bloom, David C. McIntyre, Lauren M. |
author_facet | Morse, Alison M. Calabro, Kaitlyn R. Fear, Justin M. Bloom, David C. McIntyre, Lauren M. |
author_sort | Morse, Alison M. |
collection | PubMed |
description | High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, is complicated due to the existence of large repeat regions and arrays of smaller reiterated sequences that are commonly found in these genomes. In addition, the inherent genetic variation in populations of isolates for viruses and other microorganisms presents an additional challenge to many existing HTS sequence assembly pipelines. Here, we evaluate two approaches for the identification of genetic variants in HSV1 strains using Illumina short read sequencing data. The first, a reference-based approach, identifies variants from reads aligned to a reference sequence and the second, a de novo assembly approach, identifies variants from reads aligned to de novo assembled consensus sequences. Of critical importance for both approaches is the reduction in the number of low complexity regions through the construction of a non-redundant reference genome. We compared variants identified in the two methods. Our results indicate that approximately 85% of variants are identified regardless of the approach. The reference-based approach to variant discovery captures an additional 15% representing variants divergent from the HSV1 reference possibly due to viral passage. Reference-based approaches are significantly less labor-intensive and identify variants across the genome where de novo assembly-based approaches are limited to regions where contigs have been successfully assembled. In addition, regions of poor quality assembly can lead to false variant identification in de novo consensus sequences. For viruses with a well-assembled reference genome, a reference-based approach is recommended. |
format | Online Article Text |
id | pubmed-5580483 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-55804832017-09-06 Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing Morse, Alison M. Calabro, Kaitlyn R. Fear, Justin M. Bloom, David C. McIntyre, Lauren M. Viruses Article High-throughput sequencing (HTS) has resulted in data for a number of herpes simplex virus (HSV) laboratory strains and clinical isolates. The knowledge of these sequences has been critical for investigating viral pathogenicity. However, the assembly of complete herpesviral genomes, including HSV, is complicated due to the existence of large repeat regions and arrays of smaller reiterated sequences that are commonly found in these genomes. In addition, the inherent genetic variation in populations of isolates for viruses and other microorganisms presents an additional challenge to many existing HTS sequence assembly pipelines. Here, we evaluate two approaches for the identification of genetic variants in HSV1 strains using Illumina short read sequencing data. The first, a reference-based approach, identifies variants from reads aligned to a reference sequence and the second, a de novo assembly approach, identifies variants from reads aligned to de novo assembled consensus sequences. Of critical importance for both approaches is the reduction in the number of low complexity regions through the construction of a non-redundant reference genome. We compared variants identified in the two methods. Our results indicate that approximately 85% of variants are identified regardless of the approach. The reference-based approach to variant discovery captures an additional 15% representing variants divergent from the HSV1 reference possibly due to viral passage. Reference-based approaches are significantly less labor-intensive and identify variants across the genome where de novo assembly-based approaches are limited to regions where contigs have been successfully assembled. In addition, regions of poor quality assembly can lead to false variant identification in de novo consensus sequences. For viruses with a well-assembled reference genome, a reference-based approach is recommended. MDPI 2017-08-16 /pmc/articles/PMC5580483/ /pubmed/28812996 http://dx.doi.org/10.3390/v9080226 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Morse, Alison M. Calabro, Kaitlyn R. Fear, Justin M. Bloom, David C. McIntyre, Lauren M. Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing |
title | Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing |
title_full | Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing |
title_fullStr | Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing |
title_full_unstemmed | Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing |
title_short | Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing |
title_sort | reliable detection of herpes simplex virus sequence variation by high-throughput resequencing |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5580483/ https://www.ncbi.nlm.nih.gov/pubmed/28812996 http://dx.doi.org/10.3390/v9080226 |
work_keys_str_mv | AT morsealisonm reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing AT calabrokaitlynr reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing AT fearjustinm reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing AT bloomdavidc reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing AT mcintyrelaurenm reliabledetectionofherpessimplexvirussequencevariationbyhighthroughputresequencing |