Cargando…

Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions

Dinoflagellates of the family Symbiodiniaceae are predominantly essential symbionts of corals and other marine organisms. Recent research reveals extensive genome sequence divergence among Symbiodiniaceae taxa and high phylogenetic diversity hidden behind subtly different cell morphologies. Using an...

Descripción completa

Detalles Bibliográficos
Autores principales: Lo, Rosalyn, Dougan, Katherine E., Chen, Yibi, Shah, Sarah, Bhattacharya, Debashish, Chan, Cheong Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9087856/
https://www.ncbi.nlm.nih.gov/pubmed/35557718
http://dx.doi.org/10.3389/fpls.2022.815714
_version_ 1784704244589264896
author Lo, Rosalyn
Dougan, Katherine E.
Chen, Yibi
Shah, Sarah
Bhattacharya, Debashish
Chan, Cheong Xin
author_facet Lo, Rosalyn
Dougan, Katherine E.
Chen, Yibi
Shah, Sarah
Bhattacharya, Debashish
Chan, Cheong Xin
author_sort Lo, Rosalyn
collection PubMed
description Dinoflagellates of the family Symbiodiniaceae are predominantly essential symbionts of corals and other marine organisms. Recent research reveals extensive genome sequence divergence among Symbiodiniaceae taxa and high phylogenetic diversity hidden behind subtly different cell morphologies. Using an alignment-free phylogenetic approach based on sub-sequences of fixed length k (i.e. k-mers), we assessed the phylogenetic signal among whole-genome sequences from 16 Symbiodiniaceae taxa (including the genera of Symbiodinium, Breviolum, Cladocopium, Durusdinium and Fugacium) and two strains of Polarella glacialis as outgroup. Based on phylogenetic trees inferred from k-mers in distinct genomic regions (i.e. repeat-masked genome sequences, protein-coding sequences, introns and repeats) and in protein sequences, the phylogenetic signal associated with protein-coding DNA and the encoded amino acids is largely consistent with the Symbiodiniaceae phylogeny based on established markers, such as large subunit rRNA. The other genome sequences (introns and repeats) exhibit distinct phylogenetic signals, supporting the expected differential evolutionary pressure acting on these regions. Our analysis of conserved core k-mers revealed the prevalence of conserved k-mers (>95% core 23-mers among all 18 genomes) in annotated repeats and non-genic regions of the genomes. We observed 180 distinct repeat types that are significantly enriched in genomes of the symbiotic versus free-living Symbiodinium taxa, suggesting an enhanced activity of transposable elements linked to the symbiotic lifestyle. We provide evidence that representation of alignment-free phylogenies as dynamic networks enhances the ability to generate new hypotheses about genome evolution in Symbiodiniaceae. These results demonstrate the potential of alignment-free phylogenetic methods as a scalable approach for inferring comprehensive, unbiased whole-genome phylogenies of dinoflagellates and more broadly of microbial eukaryotes.
format Online
Article
Text
id pubmed-9087856
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-90878562022-05-11 Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions Lo, Rosalyn Dougan, Katherine E. Chen, Yibi Shah, Sarah Bhattacharya, Debashish Chan, Cheong Xin Front Plant Sci Plant Science Dinoflagellates of the family Symbiodiniaceae are predominantly essential symbionts of corals and other marine organisms. Recent research reveals extensive genome sequence divergence among Symbiodiniaceae taxa and high phylogenetic diversity hidden behind subtly different cell morphologies. Using an alignment-free phylogenetic approach based on sub-sequences of fixed length k (i.e. k-mers), we assessed the phylogenetic signal among whole-genome sequences from 16 Symbiodiniaceae taxa (including the genera of Symbiodinium, Breviolum, Cladocopium, Durusdinium and Fugacium) and two strains of Polarella glacialis as outgroup. Based on phylogenetic trees inferred from k-mers in distinct genomic regions (i.e. repeat-masked genome sequences, protein-coding sequences, introns and repeats) and in protein sequences, the phylogenetic signal associated with protein-coding DNA and the encoded amino acids is largely consistent with the Symbiodiniaceae phylogeny based on established markers, such as large subunit rRNA. The other genome sequences (introns and repeats) exhibit distinct phylogenetic signals, supporting the expected differential evolutionary pressure acting on these regions. Our analysis of conserved core k-mers revealed the prevalence of conserved k-mers (>95% core 23-mers among all 18 genomes) in annotated repeats and non-genic regions of the genomes. We observed 180 distinct repeat types that are significantly enriched in genomes of the symbiotic versus free-living Symbiodinium taxa, suggesting an enhanced activity of transposable elements linked to the symbiotic lifestyle. We provide evidence that representation of alignment-free phylogenies as dynamic networks enhances the ability to generate new hypotheses about genome evolution in Symbiodiniaceae. These results demonstrate the potential of alignment-free phylogenetic methods as a scalable approach for inferring comprehensive, unbiased whole-genome phylogenies of dinoflagellates and more broadly of microbial eukaryotes. Frontiers Media S.A. 2022-04-26 /pmc/articles/PMC9087856/ /pubmed/35557718 http://dx.doi.org/10.3389/fpls.2022.815714 Text en Copyright © 2022 Lo, Dougan, Chen, Shah, Bhattacharya and Chan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Lo, Rosalyn
Dougan, Katherine E.
Chen, Yibi
Shah, Sarah
Bhattacharya, Debashish
Chan, Cheong Xin
Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions
title Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions
title_full Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions
title_fullStr Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions
title_full_unstemmed Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions
title_short Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions
title_sort alignment-free analysis of whole-genome sequences from symbiodiniaceae reveals different phylogenetic signals in distinct regions
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9087856/
https://www.ncbi.nlm.nih.gov/pubmed/35557718
http://dx.doi.org/10.3389/fpls.2022.815714
work_keys_str_mv AT lorosalyn alignmentfreeanalysisofwholegenomesequencesfromsymbiodiniaceaerevealsdifferentphylogeneticsignalsindistinctregions
AT dougankatherinee alignmentfreeanalysisofwholegenomesequencesfromsymbiodiniaceaerevealsdifferentphylogeneticsignalsindistinctregions
AT chenyibi alignmentfreeanalysisofwholegenomesequencesfromsymbiodiniaceaerevealsdifferentphylogeneticsignalsindistinctregions
AT shahsarah alignmentfreeanalysisofwholegenomesequencesfromsymbiodiniaceaerevealsdifferentphylogeneticsignalsindistinctregions
AT bhattacharyadebashish alignmentfreeanalysisofwholegenomesequencesfromsymbiodiniaceaerevealsdifferentphylogeneticsignalsindistinctregions
AT chancheongxin alignmentfreeanalysisofwholegenomesequencesfromsymbiodiniaceaerevealsdifferentphylogeneticsignalsindistinctregions