Cargando…

Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)

BACKGROUND: With an ever-growing number of published genomes, many low levels of the Tree of Life now contain several species with enough molecular data to perform shallow-scale phylogenomic studies. Moving away from using just a few universal phylogenetic markers, we can now target thousands of oth...

Descripción completa

Detalles Bibliográficos
Autores principales: Debray, Kevin, Marie-Magdelaine, Jordan, Ruttink, Tom, Clotault, Jérémy, Foucher, Fabrice, Malécot, Valéry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657147/
https://www.ncbi.nlm.nih.gov/pubmed/31340752
http://dx.doi.org/10.1186/s12862-019-1479-z
_version_ 1783438755218063360
author Debray, Kevin
Marie-Magdelaine, Jordan
Ruttink, Tom
Clotault, Jérémy
Foucher, Fabrice
Malécot, Valéry
author_facet Debray, Kevin
Marie-Magdelaine, Jordan
Ruttink, Tom
Clotault, Jérémy
Foucher, Fabrice
Malécot, Valéry
author_sort Debray, Kevin
collection PubMed
description BACKGROUND: With an ever-growing number of published genomes, many low levels of the Tree of Life now contain several species with enough molecular data to perform shallow-scale phylogenomic studies. Moving away from using just a few universal phylogenetic markers, we can now target thousands of other loci to decipher taxa relationships. Making the best possible selection of informative sequences regarding the taxa studied has emerged as a new issue. Here, we developed a general procedure to mine genomic data, looking for orthologous single-copy loci capable of deciphering phylogenetic relationships below the generic rank. To develop our strategy, we chose the genus Rosa, a rapid-evolving lineage of the Rosaceae family in which several species genomes have recently been sequenced. We also compared our loci to conventional plastid markers, commonly used for phylogenetic inference in this genus. RESULTS: We generated 1856 sequence tags in putative single-copy orthologous nuclear loci. Associated in silico primer pairs can potentially amplify fragments able to resolve a wide range of speciation events within the genus Rosa. Analysis of parsimony-informative site content showed the value of non-coding genomic regions to obtain variable sequences despite the fact that they may be more difficult to target in less related species. Dozens of nuclear loci outperform the conventional plastid phylogenetic markers in terms of phylogenetic informativeness, for both recent and ancient evolutionary divergences. However, conflicting phylogenetic signals were found between nuclear gene tree topologies and the species-tree topology, shedding light on the many patterns of hybridization and/or incomplete lineage sorting that occur in the genus Rosa. CONCLUSIONS: With recently published genome sequence data, we developed a set of single-copy orthologous nuclear loci to resolve species-level phylogenomics in the genus Rosa. This genome-wide scale dataset contains hundreds of highly variable loci which phylogenetic interest was assessed in terms of phylogenetic informativeness and topological conflict. Our target identification procedure can easily be reproduced to identify new highly informative loci for other taxonomic groups and ranks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12862-019-1479-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6657147
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66571472019-07-31 Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) Debray, Kevin Marie-Magdelaine, Jordan Ruttink, Tom Clotault, Jérémy Foucher, Fabrice Malécot, Valéry BMC Evol Biol Methodology Article BACKGROUND: With an ever-growing number of published genomes, many low levels of the Tree of Life now contain several species with enough molecular data to perform shallow-scale phylogenomic studies. Moving away from using just a few universal phylogenetic markers, we can now target thousands of other loci to decipher taxa relationships. Making the best possible selection of informative sequences regarding the taxa studied has emerged as a new issue. Here, we developed a general procedure to mine genomic data, looking for orthologous single-copy loci capable of deciphering phylogenetic relationships below the generic rank. To develop our strategy, we chose the genus Rosa, a rapid-evolving lineage of the Rosaceae family in which several species genomes have recently been sequenced. We also compared our loci to conventional plastid markers, commonly used for phylogenetic inference in this genus. RESULTS: We generated 1856 sequence tags in putative single-copy orthologous nuclear loci. Associated in silico primer pairs can potentially amplify fragments able to resolve a wide range of speciation events within the genus Rosa. Analysis of parsimony-informative site content showed the value of non-coding genomic regions to obtain variable sequences despite the fact that they may be more difficult to target in less related species. Dozens of nuclear loci outperform the conventional plastid phylogenetic markers in terms of phylogenetic informativeness, for both recent and ancient evolutionary divergences. However, conflicting phylogenetic signals were found between nuclear gene tree topologies and the species-tree topology, shedding light on the many patterns of hybridization and/or incomplete lineage sorting that occur in the genus Rosa. CONCLUSIONS: With recently published genome sequence data, we developed a set of single-copy orthologous nuclear loci to resolve species-level phylogenomics in the genus Rosa. This genome-wide scale dataset contains hundreds of highly variable loci which phylogenetic interest was assessed in terms of phylogenetic informativeness and topological conflict. Our target identification procedure can easily be reproduced to identify new highly informative loci for other taxonomic groups and ranks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12862-019-1479-z) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-24 /pmc/articles/PMC6657147/ /pubmed/31340752 http://dx.doi.org/10.1186/s12862-019-1479-z Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Debray, Kevin
Marie-Magdelaine, Jordan
Ruttink, Tom
Clotault, Jérémy
Foucher, Fabrice
Malécot, Valéry
Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)
title Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)
title_full Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)
title_fullStr Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)
title_full_unstemmed Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)
title_short Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)
title_sort identification and assessment of variable single-copy orthologous (sco) nuclear loci for low-level phylogenomics: a case study in the genus rosa (rosaceae)
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657147/
https://www.ncbi.nlm.nih.gov/pubmed/31340752
http://dx.doi.org/10.1186/s12862-019-1479-z
work_keys_str_mv AT debraykevin identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae
AT mariemagdelainejordan identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae
AT ruttinktom identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae
AT clotaultjeremy identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae
AT foucherfabrice identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae
AT malecotvalery identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae