Cargando…

hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test

1. The homoplasy excess test (HET) is a tree‐based screen for hybrid taxa in multilocus nuclear phylogenies. Homoplasy between a hybrid taxon and the clades containing the parental taxa reduces bootstrap support in the tree. The HET is based on the expectation that excluding the hybrid taxon from th...

Descripción completa

Detalles Bibliográficos
Autores principales: Schneider, Kevin, Koblmüller, Stephan, Sefc, Kristina M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4824276/
https://www.ncbi.nlm.nih.gov/pubmed/27066216
http://dx.doi.org/10.1111/2041-210X.12490
_version_ 1782426062441938944
author Schneider, Kevin
Koblmüller, Stephan
Sefc, Kristina M.
author_facet Schneider, Kevin
Koblmüller, Stephan
Sefc, Kristina M.
author_sort Schneider, Kevin
collection PubMed
description 1. The homoplasy excess test (HET) is a tree‐based screen for hybrid taxa in multilocus nuclear phylogenies. Homoplasy between a hybrid taxon and the clades containing the parental taxa reduces bootstrap support in the tree. The HET is based on the expectation that excluding the hybrid taxon from the data set increases the bootstrap support for the parental clades, whereas excluding non‐hybrid taxa has little effect on statistical node support. To carry out a HET, bootstrap trees are calculated with taxon‐jackknife data sets, that is excluding one taxon (species, population) at a time. Excess increase in bootstrap support for certain nodes upon exclusion of a particular taxon indicates the hybrid (the excluded taxon) and its parents (the clades with increased support). 2. We introduce a new software program, hext, which generates the taxon‐jackknife data sets, runs the bootstrap tree calculations, and identifies excess bootstrap increases as outlier values in boxplot graphs. hext is written in r language and accepts binary data (0/1; e.g. AFLP) as well as co‐dominant SNP and genotype data. 3. We demonstrate the usefulness of hext in large SNP data sets containing putative hybrids and their parents. For instance, using published data of the genus Vitis (˜6,000 SNP loci), hext output supports V. × champinii as a hybrid between V. rupestris and V. mustangensis. 4. With simulated SNP and AFLP data sets, excess increases in bootstrap support were not always connected with the hybrid taxon (false positives), whereas the expected bootstrap signal failed to appear on several occasions (false negatives). Potential causes for both types of spurious results are discussed. 5. With both empirical and simulated data sets, the taxon‐jackknife output generated by hext provided additional signatures of hybrid taxa, including changes in tree topology across trees, consistent effects of exclusions of the hybrid and the parent taxa, and moderate (rather than excessive) increases in bootstrap support. hext significantly facilitates the taxon‐jackknife approach to hybrid taxon detection, even though the simple test for excess bootstrap increase may not reliably identify hybrid taxa in all applications.
format Online
Article
Text
id pubmed-4824276
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-48242762016-04-07 hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test Schneider, Kevin Koblmüller, Stephan Sefc, Kristina M. Methods Ecol Evol Evolution 1. The homoplasy excess test (HET) is a tree‐based screen for hybrid taxa in multilocus nuclear phylogenies. Homoplasy between a hybrid taxon and the clades containing the parental taxa reduces bootstrap support in the tree. The HET is based on the expectation that excluding the hybrid taxon from the data set increases the bootstrap support for the parental clades, whereas excluding non‐hybrid taxa has little effect on statistical node support. To carry out a HET, bootstrap trees are calculated with taxon‐jackknife data sets, that is excluding one taxon (species, population) at a time. Excess increase in bootstrap support for certain nodes upon exclusion of a particular taxon indicates the hybrid (the excluded taxon) and its parents (the clades with increased support). 2. We introduce a new software program, hext, which generates the taxon‐jackknife data sets, runs the bootstrap tree calculations, and identifies excess bootstrap increases as outlier values in boxplot graphs. hext is written in r language and accepts binary data (0/1; e.g. AFLP) as well as co‐dominant SNP and genotype data. 3. We demonstrate the usefulness of hext in large SNP data sets containing putative hybrids and their parents. For instance, using published data of the genus Vitis (˜6,000 SNP loci), hext output supports V. × champinii as a hybrid between V. rupestris and V. mustangensis. 4. With simulated SNP and AFLP data sets, excess increases in bootstrap support were not always connected with the hybrid taxon (false positives), whereas the expected bootstrap signal failed to appear on several occasions (false negatives). Potential causes for both types of spurious results are discussed. 5. With both empirical and simulated data sets, the taxon‐jackknife output generated by hext provided additional signatures of hybrid taxa, including changes in tree topology across trees, consistent effects of exclusions of the hybrid and the parent taxa, and moderate (rather than excessive) increases in bootstrap support. hext significantly facilitates the taxon‐jackknife approach to hybrid taxon detection, even though the simple test for excess bootstrap increase may not reliably identify hybrid taxa in all applications. John Wiley and Sons Inc. 2015-11-11 2016-03 /pmc/articles/PMC4824276/ /pubmed/27066216 http://dx.doi.org/10.1111/2041-210X.12490 Text en © 2015 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Evolution
Schneider, Kevin
Koblmüller, Stephan
Sefc, Kristina M.
hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
title hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
title_full hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
title_fullStr hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
title_full_unstemmed hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
title_short hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
title_sort hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
topic Evolution
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4824276/
https://www.ncbi.nlm.nih.gov/pubmed/27066216
http://dx.doi.org/10.1111/2041-210X.12490
work_keys_str_mv AT schneiderkevin hextasoftwaresupportingtreebasedscreensforhybridtaxainmultilocusdatasetsandanevaluationofthehomoplasyexcesstest
AT koblmullerstephan hextasoftwaresupportingtreebasedscreensforhybridtaxainmultilocusdatasetsandanevaluationofthehomoplasyexcesstest
AT sefckristinam hextasoftwaresupportingtreebasedscreensforhybridtaxainmultilocusdatasetsandanevaluationofthehomoplasyexcesstest