Cargando…
ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach
Viral metagenomics, also known as virome studies, have yielded an unprecedented number of novel sequences, essential in recognizing and characterizing the etiological agent and the origin of emerging infectious diseases. Several tools and pipelines have been developed, to date, for the identificatio...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376068/ https://www.ncbi.nlm.nih.gov/pubmed/34370725 http://dx.doi.org/10.1371/journal.pcbi.1009304 |
_version_ | 1783740432353591296 |
---|---|
author | Dovrolis, Nikolas Kassela, Katerina Konstantinidis, Konstantinos Kouvela, Adamantia Veletza, Stavroula Karakasiliotis, Ioannis |
author_facet | Dovrolis, Nikolas Kassela, Katerina Konstantinidis, Konstantinos Kouvela, Adamantia Veletza, Stavroula Karakasiliotis, Ioannis |
author_sort | Dovrolis, Nikolas |
collection | PubMed |
description | Viral metagenomics, also known as virome studies, have yielded an unprecedented number of novel sequences, essential in recognizing and characterizing the etiological agent and the origin of emerging infectious diseases. Several tools and pipelines have been developed, to date, for the identification and assembly of viral genomes. Assembly pipelines often result in viral genomes contaminated with host genetic material, some of which are currently deposited into public databases. In the current report, we present a group of deposited sequences that encompass ribosomal RNA (rRNA) contamination. We highlight the detrimental role of chimeric next generation sequencing reads, between host rRNA sequences and viral sequences, in virus genome assembly and we present the hindrances these reads may pose to current methodologies. We have further developed a refining pipeline, the Zero Waste Algorithm (ZWA) that assists in the assembly of low abundance viral genomes. ZWA performs context-depended trimming of chimeric reads, precisely removing their rRNA moiety. These, otherwise discarded, reads were fed to the assembly pipeline and assisted in the construction of larger and cleaner contigs making a substantial impact on current assembly methodologies. ZWA pipeline may significantly enhance virus genome assembly from low abundance samples and virus metagenomics approaches in which a small number of reads determine genome quality and integrity. |
format | Online Article Text |
id | pubmed-8376068 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83760682021-08-20 ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach Dovrolis, Nikolas Kassela, Katerina Konstantinidis, Konstantinos Kouvela, Adamantia Veletza, Stavroula Karakasiliotis, Ioannis PLoS Comput Biol Research Article Viral metagenomics, also known as virome studies, have yielded an unprecedented number of novel sequences, essential in recognizing and characterizing the etiological agent and the origin of emerging infectious diseases. Several tools and pipelines have been developed, to date, for the identification and assembly of viral genomes. Assembly pipelines often result in viral genomes contaminated with host genetic material, some of which are currently deposited into public databases. In the current report, we present a group of deposited sequences that encompass ribosomal RNA (rRNA) contamination. We highlight the detrimental role of chimeric next generation sequencing reads, between host rRNA sequences and viral sequences, in virus genome assembly and we present the hindrances these reads may pose to current methodologies. We have further developed a refining pipeline, the Zero Waste Algorithm (ZWA) that assists in the assembly of low abundance viral genomes. ZWA performs context-depended trimming of chimeric reads, precisely removing their rRNA moiety. These, otherwise discarded, reads were fed to the assembly pipeline and assisted in the construction of larger and cleaner contigs making a substantial impact on current assembly methodologies. ZWA pipeline may significantly enhance virus genome assembly from low abundance samples and virus metagenomics approaches in which a small number of reads determine genome quality and integrity. Public Library of Science 2021-08-09 /pmc/articles/PMC8376068/ /pubmed/34370725 http://dx.doi.org/10.1371/journal.pcbi.1009304 Text en © 2021 Dovrolis et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Dovrolis, Nikolas Kassela, Katerina Konstantinidis, Konstantinos Kouvela, Adamantia Veletza, Stavroula Karakasiliotis, Ioannis ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach |
title | ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach |
title_full | ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach |
title_fullStr | ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach |
title_full_unstemmed | ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach |
title_short | ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach |
title_sort | zwa: viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8376068/ https://www.ncbi.nlm.nih.gov/pubmed/34370725 http://dx.doi.org/10.1371/journal.pcbi.1009304 |
work_keys_str_mv | AT dovrolisnikolas zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach AT kasselakaterina zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach AT konstantinidiskonstantinos zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach AT kouvelaadamantia zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach AT veletzastavroula zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach AT karakasiliotisioannis zwaviralgenomeassemblyandcharacterizationhindrancesfromvirushostchimericreadsarefiningapproach |