Cargando…

Virome Assembly and Annotation: A Surprise in the Namib Desert

Sequencing, assembly, and annotation of environmental virome samples is challenging. Methodological biases and differences in species abundance result in fragmentary read coverage; sequence reconstruction is further complicated by the mosaic nature of viral genomes. In this paper, we focus on biocom...

Descripción completa

Detalles Bibliográficos
Autores principales: Hesse, Uljana, van Heusden, Peter, Kirby, Bronwyn M., Olonade, Israel, van Zyl, Leonardo J., Trindade, Marla
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5253355/
https://www.ncbi.nlm.nih.gov/pubmed/28167933
http://dx.doi.org/10.3389/fmicb.2017.00013
_version_ 1782498142230413312
author Hesse, Uljana
van Heusden, Peter
Kirby, Bronwyn M.
Olonade, Israel
van Zyl, Leonardo J.
Trindade, Marla
author_facet Hesse, Uljana
van Heusden, Peter
Kirby, Bronwyn M.
Olonade, Israel
van Zyl, Leonardo J.
Trindade, Marla
author_sort Hesse, Uljana
collection PubMed
description Sequencing, assembly, and annotation of environmental virome samples is challenging. Methodological biases and differences in species abundance result in fragmentary read coverage; sequence reconstruction is further complicated by the mosaic nature of viral genomes. In this paper, we focus on biocomputational aspects of virome analysis, emphasizing latent pitfalls in sequence annotation. Using simulated viromes that mimic environmental data challenges we assessed the performance of five assemblers (CLC-Workbench, IDBA-UD, SPAdes, RayMeta, ABySS). Individual analyses of relevant scaffold length fractions revealed shortcomings of some programs in reconstruction of viral genomes with excessive read coverage (IDBA-UD, RayMeta), and in accurate assembly of scaffolds ≥50 kb (SPAdes, RayMeta, ABySS). The CLC-Workbench assembler performed best in terms of genome recovery (including highly covered genomes) and correct reconstruction of large scaffolds; and was used to assemble a virome from a copper rich site in the Namib Desert. We found that scaffold network analysis and cluster-specific read reassembly improved reconstruction of sequences with excessive read coverage, and that strict data filtering for non-viral sequences prior to downstream analyses was essential. In this study we describe novel viral genomes identified in the Namib Desert copper site virome. Taxonomic affiliations of diverse proteins in the dataset and phylogenetic analyses of circovirus-like proteins indicated links to the marine habitat. Considering additional evidence from this dataset we hypothesize that viruses may have been carried from the Atlantic Ocean into the Namib Desert by fog and wind, highlighting the impact of the extended environment on an investigated niche in metagenome studies.
format Online
Article
Text
id pubmed-5253355
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-52533552017-02-06 Virome Assembly and Annotation: A Surprise in the Namib Desert Hesse, Uljana van Heusden, Peter Kirby, Bronwyn M. Olonade, Israel van Zyl, Leonardo J. Trindade, Marla Front Microbiol Microbiology Sequencing, assembly, and annotation of environmental virome samples is challenging. Methodological biases and differences in species abundance result in fragmentary read coverage; sequence reconstruction is further complicated by the mosaic nature of viral genomes. In this paper, we focus on biocomputational aspects of virome analysis, emphasizing latent pitfalls in sequence annotation. Using simulated viromes that mimic environmental data challenges we assessed the performance of five assemblers (CLC-Workbench, IDBA-UD, SPAdes, RayMeta, ABySS). Individual analyses of relevant scaffold length fractions revealed shortcomings of some programs in reconstruction of viral genomes with excessive read coverage (IDBA-UD, RayMeta), and in accurate assembly of scaffolds ≥50 kb (SPAdes, RayMeta, ABySS). The CLC-Workbench assembler performed best in terms of genome recovery (including highly covered genomes) and correct reconstruction of large scaffolds; and was used to assemble a virome from a copper rich site in the Namib Desert. We found that scaffold network analysis and cluster-specific read reassembly improved reconstruction of sequences with excessive read coverage, and that strict data filtering for non-viral sequences prior to downstream analyses was essential. In this study we describe novel viral genomes identified in the Namib Desert copper site virome. Taxonomic affiliations of diverse proteins in the dataset and phylogenetic analyses of circovirus-like proteins indicated links to the marine habitat. Considering additional evidence from this dataset we hypothesize that viruses may have been carried from the Atlantic Ocean into the Namib Desert by fog and wind, highlighting the impact of the extended environment on an investigated niche in metagenome studies. Frontiers Media S.A. 2017-01-23 /pmc/articles/PMC5253355/ /pubmed/28167933 http://dx.doi.org/10.3389/fmicb.2017.00013 Text en Copyright © 2017 Hesse, van Heusden, Kirby, Olonade, van Zyl and Trindade. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Hesse, Uljana
van Heusden, Peter
Kirby, Bronwyn M.
Olonade, Israel
van Zyl, Leonardo J.
Trindade, Marla
Virome Assembly and Annotation: A Surprise in the Namib Desert
title Virome Assembly and Annotation: A Surprise in the Namib Desert
title_full Virome Assembly and Annotation: A Surprise in the Namib Desert
title_fullStr Virome Assembly and Annotation: A Surprise in the Namib Desert
title_full_unstemmed Virome Assembly and Annotation: A Surprise in the Namib Desert
title_short Virome Assembly and Annotation: A Surprise in the Namib Desert
title_sort virome assembly and annotation: a surprise in the namib desert
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5253355/
https://www.ncbi.nlm.nih.gov/pubmed/28167933
http://dx.doi.org/10.3389/fmicb.2017.00013
work_keys_str_mv AT hesseuljana viromeassemblyandannotationasurpriseinthenamibdesert
AT vanheusdenpeter viromeassemblyandannotationasurpriseinthenamibdesert
AT kirbybronwynm viromeassemblyandannotationasurpriseinthenamibdesert
AT olonadeisrael viromeassemblyandannotationasurpriseinthenamibdesert
AT vanzylleonardoj viromeassemblyandannotationasurpriseinthenamibdesert
AT trindademarla viromeassemblyandannotationasurpriseinthenamibdesert