Cargando…
Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae)
BACKGROUND: With an ever-growing number of published genomes, many low levels of the Tree of Life now contain several species with enough molecular data to perform shallow-scale phylogenomic studies. Moving away from using just a few universal phylogenetic markers, we can now target thousands of oth...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657147/ https://www.ncbi.nlm.nih.gov/pubmed/31340752 http://dx.doi.org/10.1186/s12862-019-1479-z |
_version_ | 1783438755218063360 |
---|---|
author | Debray, Kevin Marie-Magdelaine, Jordan Ruttink, Tom Clotault, Jérémy Foucher, Fabrice Malécot, Valéry |
author_facet | Debray, Kevin Marie-Magdelaine, Jordan Ruttink, Tom Clotault, Jérémy Foucher, Fabrice Malécot, Valéry |
author_sort | Debray, Kevin |
collection | PubMed |
description | BACKGROUND: With an ever-growing number of published genomes, many low levels of the Tree of Life now contain several species with enough molecular data to perform shallow-scale phylogenomic studies. Moving away from using just a few universal phylogenetic markers, we can now target thousands of other loci to decipher taxa relationships. Making the best possible selection of informative sequences regarding the taxa studied has emerged as a new issue. Here, we developed a general procedure to mine genomic data, looking for orthologous single-copy loci capable of deciphering phylogenetic relationships below the generic rank. To develop our strategy, we chose the genus Rosa, a rapid-evolving lineage of the Rosaceae family in which several species genomes have recently been sequenced. We also compared our loci to conventional plastid markers, commonly used for phylogenetic inference in this genus. RESULTS: We generated 1856 sequence tags in putative single-copy orthologous nuclear loci. Associated in silico primer pairs can potentially amplify fragments able to resolve a wide range of speciation events within the genus Rosa. Analysis of parsimony-informative site content showed the value of non-coding genomic regions to obtain variable sequences despite the fact that they may be more difficult to target in less related species. Dozens of nuclear loci outperform the conventional plastid phylogenetic markers in terms of phylogenetic informativeness, for both recent and ancient evolutionary divergences. However, conflicting phylogenetic signals were found between nuclear gene tree topologies and the species-tree topology, shedding light on the many patterns of hybridization and/or incomplete lineage sorting that occur in the genus Rosa. CONCLUSIONS: With recently published genome sequence data, we developed a set of single-copy orthologous nuclear loci to resolve species-level phylogenomics in the genus Rosa. This genome-wide scale dataset contains hundreds of highly variable loci which phylogenetic interest was assessed in terms of phylogenetic informativeness and topological conflict. Our target identification procedure can easily be reproduced to identify new highly informative loci for other taxonomic groups and ranks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12862-019-1479-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6657147 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-66571472019-07-31 Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) Debray, Kevin Marie-Magdelaine, Jordan Ruttink, Tom Clotault, Jérémy Foucher, Fabrice Malécot, Valéry BMC Evol Biol Methodology Article BACKGROUND: With an ever-growing number of published genomes, many low levels of the Tree of Life now contain several species with enough molecular data to perform shallow-scale phylogenomic studies. Moving away from using just a few universal phylogenetic markers, we can now target thousands of other loci to decipher taxa relationships. Making the best possible selection of informative sequences regarding the taxa studied has emerged as a new issue. Here, we developed a general procedure to mine genomic data, looking for orthologous single-copy loci capable of deciphering phylogenetic relationships below the generic rank. To develop our strategy, we chose the genus Rosa, a rapid-evolving lineage of the Rosaceae family in which several species genomes have recently been sequenced. We also compared our loci to conventional plastid markers, commonly used for phylogenetic inference in this genus. RESULTS: We generated 1856 sequence tags in putative single-copy orthologous nuclear loci. Associated in silico primer pairs can potentially amplify fragments able to resolve a wide range of speciation events within the genus Rosa. Analysis of parsimony-informative site content showed the value of non-coding genomic regions to obtain variable sequences despite the fact that they may be more difficult to target in less related species. Dozens of nuclear loci outperform the conventional plastid phylogenetic markers in terms of phylogenetic informativeness, for both recent and ancient evolutionary divergences. However, conflicting phylogenetic signals were found between nuclear gene tree topologies and the species-tree topology, shedding light on the many patterns of hybridization and/or incomplete lineage sorting that occur in the genus Rosa. CONCLUSIONS: With recently published genome sequence data, we developed a set of single-copy orthologous nuclear loci to resolve species-level phylogenomics in the genus Rosa. This genome-wide scale dataset contains hundreds of highly variable loci which phylogenetic interest was assessed in terms of phylogenetic informativeness and topological conflict. Our target identification procedure can easily be reproduced to identify new highly informative loci for other taxonomic groups and ranks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12862-019-1479-z) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-24 /pmc/articles/PMC6657147/ /pubmed/31340752 http://dx.doi.org/10.1186/s12862-019-1479-z Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Debray, Kevin Marie-Magdelaine, Jordan Ruttink, Tom Clotault, Jérémy Foucher, Fabrice Malécot, Valéry Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) |
title | Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) |
title_full | Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) |
title_fullStr | Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) |
title_full_unstemmed | Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) |
title_short | Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae) |
title_sort | identification and assessment of variable single-copy orthologous (sco) nuclear loci for low-level phylogenomics: a case study in the genus rosa (rosaceae) |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657147/ https://www.ncbi.nlm.nih.gov/pubmed/31340752 http://dx.doi.org/10.1186/s12862-019-1479-z |
work_keys_str_mv | AT debraykevin identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae AT mariemagdelainejordan identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae AT ruttinktom identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae AT clotaultjeremy identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae AT foucherfabrice identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae AT malecotvalery identificationandassessmentofvariablesinglecopyorthologoussconuclearlociforlowlevelphylogenomicsacasestudyinthegenusrosarosaceae |