Cargando…

Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae

Large genomic data sets are becoming the new normal in phylogenetic research, but the identification of true orthologous genes and the exclusion of problematic paralogs is still challenging when applying commonly used sequencing methods such as target enrichment. Here, we compared conventional ortho...

Descripción completa

Detalles Bibliográficos
Autores principales: Walden, Nora, Schranz, Michael Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10016055/
https://www.ncbi.nlm.nih.gov/pubmed/36848527
http://dx.doi.org/10.1093/gbe/evad034
_version_ 1784907327441207296
author Walden, Nora
Schranz, Michael Eric
author_facet Walden, Nora
Schranz, Michael Eric
author_sort Walden, Nora
collection PubMed
description Large genomic data sets are becoming the new normal in phylogenetic research, but the identification of true orthologous genes and the exclusion of problematic paralogs is still challenging when applying commonly used sequencing methods such as target enrichment. Here, we compared conventional ortholog detection using OrthoFinder with ortholog detection through genomic synteny in a data set of 11 representative diploid Brassicaceae whole-genome sequences spanning the entire phylogenetic space. Then, we evaluated the resulting gene sets regarding gene number, functional annotation, and gene and species tree resolution. Finally, we used the syntenic gene sets for comparative genomics and ancestral genome analysis. The use of synteny resulted in considerably more orthologs and also allowed us to reliably identify paralogs. Surprisingly, we did not detect notable differences between species trees reconstructed from syntenic orthologs when compared with other gene sets, including the Angiosperms353 set and a Brassicaceae-specific target enrichment gene set. However, the synteny data set comprised a multitude of gene functions, strongly suggesting that this method of marker selection for phylogenomics is suitable for studies that value downstream gene function analysis, gene interaction, and network studies. Finally, we present the first ancestral genome reconstruction for the Core Brassicaceae which predating the Brassicaceae lineage diversification ∼25 million years ago.
format Online
Article
Text
id pubmed-10016055
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100160552023-03-16 Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae Walden, Nora Schranz, Michael Eric Genome Biol Evol Article Large genomic data sets are becoming the new normal in phylogenetic research, but the identification of true orthologous genes and the exclusion of problematic paralogs is still challenging when applying commonly used sequencing methods such as target enrichment. Here, we compared conventional ortholog detection using OrthoFinder with ortholog detection through genomic synteny in a data set of 11 representative diploid Brassicaceae whole-genome sequences spanning the entire phylogenetic space. Then, we evaluated the resulting gene sets regarding gene number, functional annotation, and gene and species tree resolution. Finally, we used the syntenic gene sets for comparative genomics and ancestral genome analysis. The use of synteny resulted in considerably more orthologs and also allowed us to reliably identify paralogs. Surprisingly, we did not detect notable differences between species trees reconstructed from syntenic orthologs when compared with other gene sets, including the Angiosperms353 set and a Brassicaceae-specific target enrichment gene set. However, the synteny data set comprised a multitude of gene functions, strongly suggesting that this method of marker selection for phylogenomics is suitable for studies that value downstream gene function analysis, gene interaction, and network studies. Finally, we present the first ancestral genome reconstruction for the Core Brassicaceae which predating the Brassicaceae lineage diversification ∼25 million years ago. Oxford University Press 2023-02-28 /pmc/articles/PMC10016055/ /pubmed/36848527 http://dx.doi.org/10.1093/gbe/evad034 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Article
Walden, Nora
Schranz, Michael Eric
Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae
title Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae
title_full Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae
title_fullStr Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae
title_full_unstemmed Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae
title_short Synteny Identifies Reliable Orthologs for Phylogenomics and Comparative Genomics of the Brassicaceae
title_sort synteny identifies reliable orthologs for phylogenomics and comparative genomics of the brassicaceae
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10016055/
https://www.ncbi.nlm.nih.gov/pubmed/36848527
http://dx.doi.org/10.1093/gbe/evad034
work_keys_str_mv AT waldennora syntenyidentifiesreliableorthologsforphylogenomicsandcomparativegenomicsofthebrassicaceae
AT schranzmichaeleric syntenyidentifiesreliableorthologsforphylogenomicsandcomparativegenomicsofthebrassicaceae