Cargando…

ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach

Viral metagenomics, also known as virome studies, have yielded an unprecedented number of novel sequences, essential in recognizing and characterizing the etiological agent and the origin of emerging infectious diseases. Several tools and pipelines have been developed, to date, for the identificatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Dovrolis, Nikolas, Kassela, Katerina, Konstantinidis, Konstantinos, Kouvela, Adamantia, Veletza, Stavroula, Karakasiliotis, Ioannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376068/
https://www.ncbi.nlm.nih.gov/pubmed/34370725
http://dx.doi.org/10.1371/journal.pcbi.1009304
_version_ 1783740432353591296
author Dovrolis, Nikolas
Kassela, Katerina
Konstantinidis, Konstantinos
Kouvela, Adamantia
Veletza, Stavroula
Karakasiliotis, Ioannis
author_facet Dovrolis, Nikolas
Kassela, Katerina
Konstantinidis, Konstantinos
Kouvela, Adamantia
Veletza, Stavroula
Karakasiliotis, Ioannis
author_sort Dovrolis, Nikolas
collection PubMed
description Viral metagenomics, also known as virome studies, have yielded an unprecedented number of novel sequences, essential in recognizing and characterizing the etiological agent and the origin of emerging infectious diseases. Several tools and pipelines have been developed, to date, for the identification and assembly of viral genomes. Assembly pipelines often result in viral genomes contaminated with host genetic material, some of which are currently deposited into public databases. In the current report, we present a group of deposited sequences that encompass ribosomal RNA (rRNA) contamination. We highlight the detrimental role of chimeric next generation sequencing reads, between host rRNA sequences and viral sequences, in virus genome assembly and we present the hindrances these reads may pose to current methodologies. We have further developed a refining pipeline, the Zero Waste Algorithm (ZWA) that assists in the assembly of low abundance viral genomes. ZWA performs context-depended trimming of chimeric reads, precisely removing their rRNA moiety. These, otherwise discarded, reads were fed to the assembly pipeline and assisted in the construction of larger and cleaner contigs making a substantial impact on current assembly methodologies. ZWA pipeline may significantly enhance virus genome assembly from low abundance samples and virus metagenomics approaches in which a small number of reads determine genome quality and integrity.
format Online
Article
Text
id pubmed-8376068
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83760682021-08-20 ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach Dovrolis, Nikolas Kassela, Katerina Konstantinidis, Konstantinos Kouvela, Adamantia Veletza, Stavroula Karakasiliotis, Ioannis PLoS Comput Biol Research Article Viral metagenomics, also known as virome studies, have yielded an unprecedented number of novel sequences, essential in recognizing and characterizing the etiological agent and the origin of emerging infectious diseases. Several tools and pipelines have been developed, to date, for the identification and assembly of viral genomes. Assembly pipelines often result in viral genomes contaminated with host genetic material, some of which are currently deposited into public databases. In the current report, we present a group of deposited sequences that encompass ribosomal RNA (rRNA) contamination. We highlight the detrimental role of chimeric next generation sequencing reads, between host rRNA sequences and viral sequences, in virus genome assembly and we present the hindrances these reads may pose to current methodologies. We have further developed a refining pipeline, the Zero Waste Algorithm (ZWA) that assists in the assembly of low abundance viral genomes. ZWA performs context-depended trimming of chimeric reads, precisely removing their rRNA moiety. These, otherwise discarded, reads were fed to the assembly pipeline and assisted in the construction of larger and cleaner contigs making a substantial impact on current assembly methodologies. ZWA pipeline may significantly enhance virus genome assembly from low abundance samples and virus metagenomics approaches in which a small number of reads determine genome quality and integrity. Public Library of Science 2021-08-09 /pmc/articles/PMC8376068/ /pubmed/34370725 http://dx.doi.org/10.1371/journal.pcbi.1009304 Text en © 2021 Dovrolis et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Dovrolis, Nikolas
Kassela, Katerina
Konstantinidis, Konstantinos
Kouvela, Adamantia
Veletza, Stavroula
Karakasiliotis, Ioannis
ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
title ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
title_full ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
title_fullStr ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
title_full_unstemmed ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
title_short ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
title_sort zwa: viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376068/
https://www.ncbi.nlm.nih.gov/pubmed/34370725
http://dx.doi.org/10.1371/journal.pcbi.1009304
work_keys_str_mv AT dovrolisnikolas zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach
AT kasselakaterina zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach
AT konstantinidiskonstantinos zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach
AT kouvelaadamantia zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach
AT veletzastavroula zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach
AT karakasiliotisioannis zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach