Cargando…

Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis

BACKGROUND: Restriction site-Associated DNA sequencing (RAD-Seq) is widely applied to generate genome-wide sequence and genetic marker datasets. RAD-Seq has been extensively utilised, both at the population level and across species, for example in the construction of phylogenetic trees. However, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonen, Serap, Bishop, Stephen C, Houston, Ross D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4495686/
https://www.ncbi.nlm.nih.gov/pubmed/26152111
http://dx.doi.org/10.1186/s13104-015-1261-2
_version_ 1782380290567569408
author Gonen, Serap
Bishop, Stephen C
Houston, Ross D
author_facet Gonen, Serap
Bishop, Stephen C
Houston, Ross D
author_sort Gonen, Serap
collection PubMed
description BACKGROUND: Restriction site-Associated DNA sequencing (RAD-Seq) is widely applied to generate genome-wide sequence and genetic marker datasets. RAD-Seq has been extensively utilised, both at the population level and across species, for example in the construction of phylogenetic trees. However, the consistency of RAD-Seq data generated in different laboratories, and the potential use of cross-species orthologous RAD loci in the estimation of genetic relationships, have not been widely investigated. This study describes the use of SbfI RAD-Seq data for the estimation of evolutionary relationships amongst ten teleost fish species, using previously established phylogeny as a benchmark. RESULTS: The number of orthologous SbfI RAD loci identified decreased with increasing evolutionary distance between the species, with several thousand loci conserved across five salmonid species (divergence ~50 MY), and several hundred conserved across the more distantly related teleost species (divergence ~100–360 MY). The majority (>70%) of loci identified between the more distantly related species were genic in origin, suggesting that the bias of SbfI towards genic regions is useful for identifying distant orthologs. Interspecific single nucleotide variants at each orthologous RAD locus were identified. Evolutionary relationships estimated using concatenated sequences of interspecific variants were congruent with previously published phylogenies, even for distantly (divergence up to ~360 MY) related species. CONCLUSION: Overall, this study has demonstrated that orthologous SbfI RAD loci can be identified across closely and distantly related species. This has positive implications for the repeatability of SbfI RAD-Seq and its potential to address research questions beyond the scope of the original studies. Furthermore, the concordance in tree topologies and relationships estimated in this study with published teleost phylogenies suggests that similar meta-datasets could be utilised in the prediction of evolutionary relationships across populations and species with readily available RAD-Seq datasets, but for which relationships remain uncharacterised. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-015-1261-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4495686
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44956862015-07-09 Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis Gonen, Serap Bishop, Stephen C Houston, Ross D BMC Res Notes Research Article BACKGROUND: Restriction site-Associated DNA sequencing (RAD-Seq) is widely applied to generate genome-wide sequence and genetic marker datasets. RAD-Seq has been extensively utilised, both at the population level and across species, for example in the construction of phylogenetic trees. However, the consistency of RAD-Seq data generated in different laboratories, and the potential use of cross-species orthologous RAD loci in the estimation of genetic relationships, have not been widely investigated. This study describes the use of SbfI RAD-Seq data for the estimation of evolutionary relationships amongst ten teleost fish species, using previously established phylogeny as a benchmark. RESULTS: The number of orthologous SbfI RAD loci identified decreased with increasing evolutionary distance between the species, with several thousand loci conserved across five salmonid species (divergence ~50 MY), and several hundred conserved across the more distantly related teleost species (divergence ~100–360 MY). The majority (>70%) of loci identified between the more distantly related species were genic in origin, suggesting that the bias of SbfI towards genic regions is useful for identifying distant orthologs. Interspecific single nucleotide variants at each orthologous RAD locus were identified. Evolutionary relationships estimated using concatenated sequences of interspecific variants were congruent with previously published phylogenies, even for distantly (divergence up to ~360 MY) related species. CONCLUSION: Overall, this study has demonstrated that orthologous SbfI RAD loci can be identified across closely and distantly related species. This has positive implications for the repeatability of SbfI RAD-Seq and its potential to address research questions beyond the scope of the original studies. Furthermore, the concordance in tree topologies and relationships estimated in this study with published teleost phylogenies suggests that similar meta-datasets could be utilised in the prediction of evolutionary relationships across populations and species with readily available RAD-Seq datasets, but for which relationships remain uncharacterised. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-015-1261-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-08 /pmc/articles/PMC4495686/ /pubmed/26152111 http://dx.doi.org/10.1186/s13104-015-1261-2 Text en © Gonen et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Gonen, Serap
Bishop, Stephen C
Houston, Ross D
Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis
title Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis
title_full Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis
title_fullStr Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis
title_full_unstemmed Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis
title_short Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis
title_sort exploring the utility of cross-laboratory rad-sequencing datasets for phylogenetic analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4495686/
https://www.ncbi.nlm.nih.gov/pubmed/26152111
http://dx.doi.org/10.1186/s13104-015-1261-2
work_keys_str_mv AT gonenserap exploringtheutilityofcrosslaboratoryradsequencingdatasetsforphylogeneticanalysis
AT bishopstephenc exploringtheutilityofcrosslaboratoryradsequencingdatasetsforphylogeneticanalysis
AT houstonrossd exploringtheutilityofcrosslaboratoryradsequencingdatasetsforphylogeneticanalysis