Cargando…
hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test
1. The homoplasy excess test (HET) is a tree‐based screen for hybrid taxa in multilocus nuclear phylogenies. Homoplasy between a hybrid taxon and the clades containing the parental taxa reduces bootstrap support in the tree. The HET is based on the expectation that excluding the hybrid taxon from th...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4824276/ https://www.ncbi.nlm.nih.gov/pubmed/27066216 http://dx.doi.org/10.1111/2041-210X.12490 |
_version_ | 1782426062441938944 |
---|---|
author | Schneider, Kevin Koblmüller, Stephan Sefc, Kristina M. |
author_facet | Schneider, Kevin Koblmüller, Stephan Sefc, Kristina M. |
author_sort | Schneider, Kevin |
collection | PubMed |
description | 1. The homoplasy excess test (HET) is a tree‐based screen for hybrid taxa in multilocus nuclear phylogenies. Homoplasy between a hybrid taxon and the clades containing the parental taxa reduces bootstrap support in the tree. The HET is based on the expectation that excluding the hybrid taxon from the data set increases the bootstrap support for the parental clades, whereas excluding non‐hybrid taxa has little effect on statistical node support. To carry out a HET, bootstrap trees are calculated with taxon‐jackknife data sets, that is excluding one taxon (species, population) at a time. Excess increase in bootstrap support for certain nodes upon exclusion of a particular taxon indicates the hybrid (the excluded taxon) and its parents (the clades with increased support). 2. We introduce a new software program, hext, which generates the taxon‐jackknife data sets, runs the bootstrap tree calculations, and identifies excess bootstrap increases as outlier values in boxplot graphs. hext is written in r language and accepts binary data (0/1; e.g. AFLP) as well as co‐dominant SNP and genotype data. 3. We demonstrate the usefulness of hext in large SNP data sets containing putative hybrids and their parents. For instance, using published data of the genus Vitis (˜6,000 SNP loci), hext output supports V. × champinii as a hybrid between V. rupestris and V. mustangensis. 4. With simulated SNP and AFLP data sets, excess increases in bootstrap support were not always connected with the hybrid taxon (false positives), whereas the expected bootstrap signal failed to appear on several occasions (false negatives). Potential causes for both types of spurious results are discussed. 5. With both empirical and simulated data sets, the taxon‐jackknife output generated by hext provided additional signatures of hybrid taxa, including changes in tree topology across trees, consistent effects of exclusions of the hybrid and the parent taxa, and moderate (rather than excessive) increases in bootstrap support. hext significantly facilitates the taxon‐jackknife approach to hybrid taxon detection, even though the simple test for excess bootstrap increase may not reliably identify hybrid taxa in all applications. |
format | Online Article Text |
id | pubmed-4824276 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-48242762016-04-07 hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test Schneider, Kevin Koblmüller, Stephan Sefc, Kristina M. Methods Ecol Evol Evolution 1. The homoplasy excess test (HET) is a tree‐based screen for hybrid taxa in multilocus nuclear phylogenies. Homoplasy between a hybrid taxon and the clades containing the parental taxa reduces bootstrap support in the tree. The HET is based on the expectation that excluding the hybrid taxon from the data set increases the bootstrap support for the parental clades, whereas excluding non‐hybrid taxa has little effect on statistical node support. To carry out a HET, bootstrap trees are calculated with taxon‐jackknife data sets, that is excluding one taxon (species, population) at a time. Excess increase in bootstrap support for certain nodes upon exclusion of a particular taxon indicates the hybrid (the excluded taxon) and its parents (the clades with increased support). 2. We introduce a new software program, hext, which generates the taxon‐jackknife data sets, runs the bootstrap tree calculations, and identifies excess bootstrap increases as outlier values in boxplot graphs. hext is written in r language and accepts binary data (0/1; e.g. AFLP) as well as co‐dominant SNP and genotype data. 3. We demonstrate the usefulness of hext in large SNP data sets containing putative hybrids and their parents. For instance, using published data of the genus Vitis (˜6,000 SNP loci), hext output supports V. × champinii as a hybrid between V. rupestris and V. mustangensis. 4. With simulated SNP and AFLP data sets, excess increases in bootstrap support were not always connected with the hybrid taxon (false positives), whereas the expected bootstrap signal failed to appear on several occasions (false negatives). Potential causes for both types of spurious results are discussed. 5. With both empirical and simulated data sets, the taxon‐jackknife output generated by hext provided additional signatures of hybrid taxa, including changes in tree topology across trees, consistent effects of exclusions of the hybrid and the parent taxa, and moderate (rather than excessive) increases in bootstrap support. hext significantly facilitates the taxon‐jackknife approach to hybrid taxon detection, even though the simple test for excess bootstrap increase may not reliably identify hybrid taxa in all applications. John Wiley and Sons Inc. 2015-11-11 2016-03 /pmc/articles/PMC4824276/ /pubmed/27066216 http://dx.doi.org/10.1111/2041-210X.12490 Text en © 2015 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Evolution Schneider, Kevin Koblmüller, Stephan Sefc, Kristina M. hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test |
title |
hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test |
title_full |
hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test |
title_fullStr |
hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test |
title_full_unstemmed |
hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test |
title_short |
hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test |
title_sort | hext, a software supporting tree‐based screens for hybrid taxa in multilocus data sets, and an evaluation of the homoplasy excess test |
topic | Evolution |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4824276/ https://www.ncbi.nlm.nih.gov/pubmed/27066216 http://dx.doi.org/10.1111/2041-210X.12490 |
work_keys_str_mv | AT schneiderkevin hextasoftwaresupportingtreebasedscreensforhybridtaxainmultilocusdatasetsandanevaluationofthehomoplasyexcesstest AT koblmullerstephan hextasoftwaresupportingtreebasedscreensforhybridtaxainmultilocusdatasetsandanevaluationofthehomoplasyexcesstest AT sefckristinam hextasoftwaresupportingtreebasedscreensforhybridtaxainmultilocusdatasetsandanevaluationofthehomoplasyexcesstest |