Cargando…

Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome

Herpes simplex virus serotype 2 (HSV-2) is a ubiquitous human pathogen that causes recurrent genital infections and ulcerations. Many HSV-2 strains with different biological properties have been identified, but only the genomes of HSV-2 strains HG52, SD90e and 333 have been reported as complete and...

Descripción completa

Detalles Bibliográficos
Autores principales: López-Muñoz, Alberto Domingo, Rastrojo, Alberto, Kropp, Kai A., Viejo-Borbolla, Abel, Alcamí, Antonio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8461477/
https://www.ncbi.nlm.nih.gov/pubmed/34170814
http://dx.doi.org/10.1099/mgen.0.000586
_version_ 1784571993943703552
author López-Muñoz, Alberto Domingo
Rastrojo, Alberto
Kropp, Kai A.
Viejo-Borbolla, Abel
Alcamí, Antonio
author_facet López-Muñoz, Alberto Domingo
Rastrojo, Alberto
Kropp, Kai A.
Viejo-Borbolla, Abel
Alcamí, Antonio
author_sort López-Muñoz, Alberto Domingo
collection PubMed
description Herpes simplex virus serotype 2 (HSV-2) is a ubiquitous human pathogen that causes recurrent genital infections and ulcerations. Many HSV-2 strains with different biological properties have been identified, but only the genomes of HSV-2 strains HG52, SD90e and 333 have been reported as complete and fully characterized sequences. We de novo assembled, annotated and manually curated the complete genome sequence of HSV-2 strain MS, a highly neurovirulent strain, originally isolated from a multiple sclerosis patient. We resolved both DNA ends, as well as the complex inverted repeats regions present in HSV genomes, usually undisclosed in previous published partial herpesvirus genomes, using long reads from Pacific Biosciences (PacBio) technology. Additionally, we identified isomeric genomes by determining the alternative relative orientation of unique fragments in the genome of the sequenced viral population. Illumina short-read sequencing was crucial to examine genetic variability, such as nucleotide polymorphisms, insertion/deletions and sequence determinants of strain-specific virulence factors. We used Illumina data to fix two disrupted open reading frames found in coding homopolymers after PacBio assembly. These results support the combination of long- and short-read sequencing technologies as a precise and effective approach for the accurate de novo assembly and curation of complex microbial genomes.
format Online
Article
Text
id pubmed-8461477
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-84614772021-09-24 Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome López-Muñoz, Alberto Domingo Rastrojo, Alberto Kropp, Kai A. Viejo-Borbolla, Abel Alcamí, Antonio Microb Genom Research Articles Herpes simplex virus serotype 2 (HSV-2) is a ubiquitous human pathogen that causes recurrent genital infections and ulcerations. Many HSV-2 strains with different biological properties have been identified, but only the genomes of HSV-2 strains HG52, SD90e and 333 have been reported as complete and fully characterized sequences. We de novo assembled, annotated and manually curated the complete genome sequence of HSV-2 strain MS, a highly neurovirulent strain, originally isolated from a multiple sclerosis patient. We resolved both DNA ends, as well as the complex inverted repeats regions present in HSV genomes, usually undisclosed in previous published partial herpesvirus genomes, using long reads from Pacific Biosciences (PacBio) technology. Additionally, we identified isomeric genomes by determining the alternative relative orientation of unique fragments in the genome of the sequenced viral population. Illumina short-read sequencing was crucial to examine genetic variability, such as nucleotide polymorphisms, insertion/deletions and sequence determinants of strain-specific virulence factors. We used Illumina data to fix two disrupted open reading frames found in coding homopolymers after PacBio assembly. These results support the combination of long- and short-read sequencing technologies as a precise and effective approach for the accurate de novo assembly and curation of complex microbial genomes. Microbiology Society 2021-06-25 /pmc/articles/PMC8461477/ /pubmed/34170814 http://dx.doi.org/10.1099/mgen.0.000586 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
spellingShingle Research Articles
López-Muñoz, Alberto Domingo
Rastrojo, Alberto
Kropp, Kai A.
Viejo-Borbolla, Abel
Alcamí, Antonio
Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome
title Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome
title_full Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome
title_fullStr Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome
title_full_unstemmed Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome
title_short Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome
title_sort combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain ms complete genome
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8461477/
https://www.ncbi.nlm.nih.gov/pubmed/34170814
http://dx.doi.org/10.1099/mgen.0.000586
work_keys_str_mv AT lopezmunozalbertodomingo combinationoflongandshortreadsequencingfullyresolvescomplexrepeatsofherpessimplexvirus2strainmscompletegenome
AT rastrojoalberto combinationoflongandshortreadsequencingfullyresolvescomplexrepeatsofherpessimplexvirus2strainmscompletegenome
AT kroppkaia combinationoflongandshortreadsequencingfullyresolvescomplexrepeatsofherpessimplexvirus2strainmscompletegenome
AT viejoborbollaabel combinationoflongandshortreadsequencingfullyresolvescomplexrepeatsofherpessimplexvirus2strainmscompletegenome
AT alcamiantonio combinationoflongandshortreadsequencingfullyresolvescomplexrepeatsofherpessimplexvirus2strainmscompletegenome