Cargando…

Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data

Ancient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informat...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuritzin, Andrej, Kischka, Tabea, Schmitz, Jürgen, Churakov, Gennady
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4788455/
https://www.ncbi.nlm.nih.gov/pubmed/26967525
http://dx.doi.org/10.1371/journal.pcbi.1004812
_version_ 1782420735915982848
author Kuritzin, Andrej
Kischka, Tabea
Schmitz, Jürgen
Churakov, Gennady
author_facet Kuritzin, Andrej
Kischka, Tabea
Schmitz, Jürgen
Churakov, Gennady
author_sort Kuritzin, Andrej
collection PubMed
description Ancient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informative insertion is that the inserted element was fixed in the ancestral population before speciation; if not, polymorphically inserted elements may lead to random distributions of presence/absence states during speciation and possibly to apparently conflicting reconstructions of their ancestry. Fortunately, such misleading fixed cases are relatively rare but nevertheless, need to be considered. Here, we present novel, comprehensive statistical models applicable for (1) analyzing any pattern of rare genomic changes, (2) testing and differentiating conflicting phylogenetic reconstructions based on rare genomic changes caused by incomplete lineage sorting or/and ancestral hybridization, and (3) differentiating between search strategies involving genome information from one or several lineages. When the new statistics are applied, in non-conflicting cases a minimum of three elements present in both of two species and absent in a third group are considered significant support (p<0.05) for the branching of the third from the other two, if all three of the given species are screened equally for genome or experimental data. Five elements are necessary for significant support (p<0.05) if a diagnostic locus derived from only one of three species is screened, and no conflicting markers are detected. Most potentially conflicting patterns can be evaluated for their significance and ancestral hybridization can be distinguished from incomplete lineage sorting by considering symmetric or asymmetric distribution of rare genomic changes among possible tree configurations. Additionally, we provide an R-application to make the new KKSC insertion significance test available for the scientific community at http://retrogenomics.uni-muenster.de:3838/KKSC_significance_test/.
format Online
Article
Text
id pubmed-4788455
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47884552016-03-23 Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data Kuritzin, Andrej Kischka, Tabea Schmitz, Jürgen Churakov, Gennady PLoS Comput Biol Research Article Ancient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informative insertion is that the inserted element was fixed in the ancestral population before speciation; if not, polymorphically inserted elements may lead to random distributions of presence/absence states during speciation and possibly to apparently conflicting reconstructions of their ancestry. Fortunately, such misleading fixed cases are relatively rare but nevertheless, need to be considered. Here, we present novel, comprehensive statistical models applicable for (1) analyzing any pattern of rare genomic changes, (2) testing and differentiating conflicting phylogenetic reconstructions based on rare genomic changes caused by incomplete lineage sorting or/and ancestral hybridization, and (3) differentiating between search strategies involving genome information from one or several lineages. When the new statistics are applied, in non-conflicting cases a minimum of three elements present in both of two species and absent in a third group are considered significant support (p<0.05) for the branching of the third from the other two, if all three of the given species are screened equally for genome or experimental data. Five elements are necessary for significant support (p<0.05) if a diagnostic locus derived from only one of three species is screened, and no conflicting markers are detected. Most potentially conflicting patterns can be evaluated for their significance and ancestral hybridization can be distinguished from incomplete lineage sorting by considering symmetric or asymmetric distribution of rare genomic changes among possible tree configurations. Additionally, we provide an R-application to make the new KKSC insertion significance test available for the scientific community at http://retrogenomics.uni-muenster.de:3838/KKSC_significance_test/. Public Library of Science 2016-03-11 /pmc/articles/PMC4788455/ /pubmed/26967525 http://dx.doi.org/10.1371/journal.pcbi.1004812 Text en © 2016 Kuritzin et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kuritzin, Andrej
Kischka, Tabea
Schmitz, Jürgen
Churakov, Gennady
Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data
title Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data
title_full Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data
title_fullStr Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data
title_full_unstemmed Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data
title_short Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data
title_sort incomplete lineage sorting and hybridization statistics for large-scale retroposon insertion data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4788455/
https://www.ncbi.nlm.nih.gov/pubmed/26967525
http://dx.doi.org/10.1371/journal.pcbi.1004812
work_keys_str_mv AT kuritzinandrej incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata
AT kischkatabea incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata
AT schmitzjurgen incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata
AT churakovgennady incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata