Cargando…

Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations

The nucleocytoplasmic large DNA viruses (NCLDVs, phylum Nucleocytoviricota) infect vertebrates, invertebrates, algae, amoebae, and other unicellular organisms across supergroups of eukaryotes and in various ecosystems. The expanding collection of their genome sequences has revolutionized our view of...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Tsu-Wang, Ku, Chuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8570155/
https://www.ncbi.nlm.nih.gov/pubmed/34754514
http://dx.doi.org/10.1093/ve/veab081
_version_ 1784594784621428736
author Sun, Tsu-Wang
Ku, Chuan
author_facet Sun, Tsu-Wang
Ku, Chuan
author_sort Sun, Tsu-Wang
collection PubMed
description The nucleocytoplasmic large DNA viruses (NCLDVs, phylum Nucleocytoviricota) infect vertebrates, invertebrates, algae, amoebae, and other unicellular organisms across supergroups of eukaryotes and in various ecosystems. The expanding collection of their genome sequences has revolutionized our view of virus genome size and coding capacity. Phylogenetic trees based on a few core genes are commonly used as a model to understand their evolution. However, the tree topology can differ between analyses, and the vast majority of encoded genes might not share a common evolutionary history. To explore the whole-genome variation and evolution of NCLDVs, we dissected their gene contents using clustering, network, and comparative analyses. Our updated core-gene tree served as a framework to classify NCLDVs into families and intrafamilial lineages, but networks of individual genomes and family pangenomes showed patterns of gene sharing that contradict with the tree topology, in particular at higher taxonomic levels. Clustering of NCLDV genomes revealed variable granularity and degrees of gene sharing within each family, which cannot be inferred from the tree. At the level of NCLDV families, a correlation exists between gene content variation, but not core-gene sequence divergence, and host supergroup diversity. In addition, there is significantly higher gene sharing between divergent viruses that infect similar host types. The identified shared genes would be a useful resource for further functional analyses of NCLDV–host interactions. Overall this study provides a comprehensive view of gene repertoire variation in NCLDVs at different taxonomic levels, as well as a novel approach to studying the extremely diverse giant virus genomes.
format Online
Article
Text
id pubmed-8570155
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85701552021-11-08 Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations Sun, Tsu-Wang Ku, Chuan Virus Evol Research Article The nucleocytoplasmic large DNA viruses (NCLDVs, phylum Nucleocytoviricota) infect vertebrates, invertebrates, algae, amoebae, and other unicellular organisms across supergroups of eukaryotes and in various ecosystems. The expanding collection of their genome sequences has revolutionized our view of virus genome size and coding capacity. Phylogenetic trees based on a few core genes are commonly used as a model to understand their evolution. However, the tree topology can differ between analyses, and the vast majority of encoded genes might not share a common evolutionary history. To explore the whole-genome variation and evolution of NCLDVs, we dissected their gene contents using clustering, network, and comparative analyses. Our updated core-gene tree served as a framework to classify NCLDVs into families and intrafamilial lineages, but networks of individual genomes and family pangenomes showed patterns of gene sharing that contradict with the tree topology, in particular at higher taxonomic levels. Clustering of NCLDV genomes revealed variable granularity and degrees of gene sharing within each family, which cannot be inferred from the tree. At the level of NCLDV families, a correlation exists between gene content variation, but not core-gene sequence divergence, and host supergroup diversity. In addition, there is significantly higher gene sharing between divergent viruses that infect similar host types. The identified shared genes would be a useful resource for further functional analyses of NCLDV–host interactions. Overall this study provides a comprehensive view of gene repertoire variation in NCLDVs at different taxonomic levels, as well as a novel approach to studying the extremely diverse giant virus genomes. Oxford University Press 2021-09-16 /pmc/articles/PMC8570155/ /pubmed/34754514 http://dx.doi.org/10.1093/ve/veab081 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sun, Tsu-Wang
Ku, Chuan
Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations
title Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations
title_full Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations
title_fullStr Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations
title_full_unstemmed Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations
title_short Unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations
title_sort unraveling gene content variation across eukaryotic giant viruses based on network analyses and host associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8570155/
https://www.ncbi.nlm.nih.gov/pubmed/34754514
http://dx.doi.org/10.1093/ve/veab081
work_keys_str_mv AT suntsuwang unravelinggenecontentvariationacrosseukaryoticgiantvirusesbasedonnetworkanalysesandhostassociations
AT kuchuan unravelinggenecontentvariationacrosseukaryoticgiantvirusesbasedonnetworkanalysesandhostassociations