Cargando…

Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis

BACKGROUND: Analysis of metagenomic sequences has become the principal approach for the study of the diversity of viruses. Many recent, extensive metagenomic studies on several classes of viruses have dramatically expanded the visible part of the virosphere, showing that previously undetected viruse...

Descripción completa

Detalles Bibliográficos
Autores principales: Yutin, Natalya, Bäckström, Disa, Ettema, Thijs J. G., Krupovic, Mart, Koonin, Eugene V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5894146/
https://www.ncbi.nlm.nih.gov/pubmed/29636073
http://dx.doi.org/10.1186/s12985-018-0974-y
_version_ 1783313440112115712
author Yutin, Natalya
Bäckström, Disa
Ettema, Thijs J. G.
Krupovic, Mart
Koonin, Eugene V.
author_facet Yutin, Natalya
Bäckström, Disa
Ettema, Thijs J. G.
Krupovic, Mart
Koonin, Eugene V.
author_sort Yutin, Natalya
collection PubMed
description BACKGROUND: Analysis of metagenomic sequences has become the principal approach for the study of the diversity of viruses. Many recent, extensive metagenomic studies on several classes of viruses have dramatically expanded the visible part of the virosphere, showing that previously undetected viruses, or those that have been considered rare, actually are important components of the global virome. RESULTS: We investigated the provenance of viruses related to tail-less bacteriophages of the family Tectiviridae by searching genomic and metagenomics sequence databases for distant homologs of the tectivirus-like Double Jelly-Roll major capsid proteins (DJR MCP). These searches resulted in the identification of numerous genomes of virus-like elements that are similar in size to tectiviruses (10–15 kilobases) and have diverse gene compositions. By comparison of the gene repertoires, the DJR MCP-encoding genomes were classified into 6 distinct groups that can be predicted to differ in reproduction strategies and host ranges. Only the DJR MCP gene that is present by design is shared by all these genomes, and most also encode a predicted DNA-packaging ATPase; the rest of the genes are present only in subgroups of this unexpectedly diverse collection of DJR MCP-encoding genomes. Only a minority encode a DNA polymerase which is a hallmark of the family Tectiviridae and the putative family "Autolykiviridae". Notably, one of the identified putative DJR MCP viruses encodes a homolog of Cas1 endonuclease, the integrase involved in CRISPR-Cas adaptation and integration of transposon-like elements called casposons. This is the first detected occurrence of Cas1 in a virus. Many of the identified elements are individual contigs flanked by inverted or direct repeats and appear to represent complete, extrachromosomal viral genomes, whereas others are flanked by bacterial genes and thus can be considered as proviruses. These contigs come from metagenomes of widely different environments, some dominated by archaea and others by bacteria, suggesting that collectively, the DJR MCP-encoding elements have a broad host range among prokaryotes. CONCLUSIONS: The findings reported here greatly expand the known host range of (putative) viruses of bacteria and archaea that encode a DJR MCP. They also demonstrate the extreme diversity of genome architectures in these viruses that encode no universal proteins other than the capsid protein that was used as the marker for their identification. From a supposedly minor group of bacterial and archaeal viruses, these viruses are emerging as a substantial component of the prokaryotic virome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12985-018-0974-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5894146
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58941462018-04-12 Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis Yutin, Natalya Bäckström, Disa Ettema, Thijs J. G. Krupovic, Mart Koonin, Eugene V. Virol J Research BACKGROUND: Analysis of metagenomic sequences has become the principal approach for the study of the diversity of viruses. Many recent, extensive metagenomic studies on several classes of viruses have dramatically expanded the visible part of the virosphere, showing that previously undetected viruses, or those that have been considered rare, actually are important components of the global virome. RESULTS: We investigated the provenance of viruses related to tail-less bacteriophages of the family Tectiviridae by searching genomic and metagenomics sequence databases for distant homologs of the tectivirus-like Double Jelly-Roll major capsid proteins (DJR MCP). These searches resulted in the identification of numerous genomes of virus-like elements that are similar in size to tectiviruses (10–15 kilobases) and have diverse gene compositions. By comparison of the gene repertoires, the DJR MCP-encoding genomes were classified into 6 distinct groups that can be predicted to differ in reproduction strategies and host ranges. Only the DJR MCP gene that is present by design is shared by all these genomes, and most also encode a predicted DNA-packaging ATPase; the rest of the genes are present only in subgroups of this unexpectedly diverse collection of DJR MCP-encoding genomes. Only a minority encode a DNA polymerase which is a hallmark of the family Tectiviridae and the putative family "Autolykiviridae". Notably, one of the identified putative DJR MCP viruses encodes a homolog of Cas1 endonuclease, the integrase involved in CRISPR-Cas adaptation and integration of transposon-like elements called casposons. This is the first detected occurrence of Cas1 in a virus. Many of the identified elements are individual contigs flanked by inverted or direct repeats and appear to represent complete, extrachromosomal viral genomes, whereas others are flanked by bacterial genes and thus can be considered as proviruses. These contigs come from metagenomes of widely different environments, some dominated by archaea and others by bacteria, suggesting that collectively, the DJR MCP-encoding elements have a broad host range among prokaryotes. CONCLUSIONS: The findings reported here greatly expand the known host range of (putative) viruses of bacteria and archaea that encode a DJR MCP. They also demonstrate the extreme diversity of genome architectures in these viruses that encode no universal proteins other than the capsid protein that was used as the marker for their identification. From a supposedly minor group of bacterial and archaeal viruses, these viruses are emerging as a substantial component of the prokaryotic virome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12985-018-0974-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-10 /pmc/articles/PMC5894146/ /pubmed/29636073 http://dx.doi.org/10.1186/s12985-018-0974-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Yutin, Natalya
Bäckström, Disa
Ettema, Thijs J. G.
Krupovic, Mart
Koonin, Eugene V.
Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis
title Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis
title_full Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis
title_fullStr Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis
title_full_unstemmed Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis
title_short Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis
title_sort vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5894146/
https://www.ncbi.nlm.nih.gov/pubmed/29636073
http://dx.doi.org/10.1186/s12985-018-0974-y
work_keys_str_mv AT yutinnatalya vastdiversityofprokaryoticvirusgenomesencodingdoublejellyrollmajorcapsidproteinsuncoveredbygenomicandmetagenomicsequenceanalysis
AT backstromdisa vastdiversityofprokaryoticvirusgenomesencodingdoublejellyrollmajorcapsidproteinsuncoveredbygenomicandmetagenomicsequenceanalysis
AT ettemathijsjg vastdiversityofprokaryoticvirusgenomesencodingdoublejellyrollmajorcapsidproteinsuncoveredbygenomicandmetagenomicsequenceanalysis
AT krupovicmart vastdiversityofprokaryoticvirusgenomesencodingdoublejellyrollmajorcapsidproteinsuncoveredbygenomicandmetagenomicsequenceanalysis
AT koonineugenev vastdiversityofprokaryoticvirusgenomesencodingdoublejellyrollmajorcapsidproteinsuncoveredbygenomicandmetagenomicsequenceanalysis