Cargando…
WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events
BACKGROUND: Multiple processes impact the probability of retention of individual genes following whole genome duplication (WGD) events. In analyzing two consecutive whole genome duplication events that occurred in the lineage leading to Atlantic salmon, a new phylogenetic statistical analysis was de...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701042/ https://www.ncbi.nlm.nih.gov/pubmed/36434497 http://dx.doi.org/10.1186/s12859-022-05042-w |
_version_ | 1784839454740971520 |
---|---|
author | Henry, C. Nicholas Piper, Kathryn Wilson, Amanda E. Miraszek, John L. Probst, Claire S. Rong, Yuying Liberles, David A. |
author_facet | Henry, C. Nicholas Piper, Kathryn Wilson, Amanda E. Miraszek, John L. Probst, Claire S. Rong, Yuying Liberles, David A. |
author_sort | Henry, C. Nicholas |
collection | PubMed |
description | BACKGROUND: Multiple processes impact the probability of retention of individual genes following whole genome duplication (WGD) events. In analyzing two consecutive whole genome duplication events that occurred in the lineage leading to Atlantic salmon, a new phylogenetic statistical analysis was developed to examine the contingency of retention in one event based upon retention in a previous event. This analysis is intended to evaluate mechanisms of duplicate gene retention and to provide software to generate the test statistic for any genome with pairs of WGDs in its history. RESULTS: Here a software package written in Python, ‘WGDTree’ for the analysis of duplicate gene retention following whole genome duplication events is presented. Using gene tree-species tree reconciliation to label gene duplicate nodes and differentiate between WGD and SSD duplicates, the tool calculates a statistic based upon the conditional probability of a gene duplicate being retained after a second whole genome duplication dependent upon the retention status after the first event. The package also contains methods for the simulation of gene trees with WGD events. After running simulations, the accuracy of the placement of events has been determined to be high. The conditional probability statistic has been calculated for Phalaenopsis equestris on a monocot species tree with a pair of consecutive WGD events on its lineage, showing the applicability of the method. CONCLUSIONS: A new software tool has been created for the analysis of duplicate genes in examination of retention mechanisms. The software tool has been made available on the Python package index and the source code can be found on GitHub here: https://github.com/cnickh/wgdtree. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05042-w. |
format | Online Article Text |
id | pubmed-9701042 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97010422022-11-27 WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events Henry, C. Nicholas Piper, Kathryn Wilson, Amanda E. Miraszek, John L. Probst, Claire S. Rong, Yuying Liberles, David A. BMC Bioinformatics Software BACKGROUND: Multiple processes impact the probability of retention of individual genes following whole genome duplication (WGD) events. In analyzing two consecutive whole genome duplication events that occurred in the lineage leading to Atlantic salmon, a new phylogenetic statistical analysis was developed to examine the contingency of retention in one event based upon retention in a previous event. This analysis is intended to evaluate mechanisms of duplicate gene retention and to provide software to generate the test statistic for any genome with pairs of WGDs in its history. RESULTS: Here a software package written in Python, ‘WGDTree’ for the analysis of duplicate gene retention following whole genome duplication events is presented. Using gene tree-species tree reconciliation to label gene duplicate nodes and differentiate between WGD and SSD duplicates, the tool calculates a statistic based upon the conditional probability of a gene duplicate being retained after a second whole genome duplication dependent upon the retention status after the first event. The package also contains methods for the simulation of gene trees with WGD events. After running simulations, the accuracy of the placement of events has been determined to be high. The conditional probability statistic has been calculated for Phalaenopsis equestris on a monocot species tree with a pair of consecutive WGD events on its lineage, showing the applicability of the method. CONCLUSIONS: A new software tool has been created for the analysis of duplicate genes in examination of retention mechanisms. The software tool has been made available on the Python package index and the source code can be found on GitHub here: https://github.com/cnickh/wgdtree. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05042-w. BioMed Central 2022-11-24 /pmc/articles/PMC9701042/ /pubmed/36434497 http://dx.doi.org/10.1186/s12859-022-05042-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Henry, C. Nicholas Piper, Kathryn Wilson, Amanda E. Miraszek, John L. Probst, Claire S. Rong, Yuying Liberles, David A. WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events |
title | WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events |
title_full | WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events |
title_fullStr | WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events |
title_full_unstemmed | WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events |
title_short | WGDTree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events |
title_sort | wgdtree: a phylogenetic software tool to examine conditional probabilities of retention following whole genome duplication events |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701042/ https://www.ncbi.nlm.nih.gov/pubmed/36434497 http://dx.doi.org/10.1186/s12859-022-05042-w |
work_keys_str_mv | AT henrycnicholas wgdtreeaphylogeneticsoftwaretooltoexamineconditionalprobabilitiesofretentionfollowingwholegenomeduplicationevents AT piperkathryn wgdtreeaphylogeneticsoftwaretooltoexamineconditionalprobabilitiesofretentionfollowingwholegenomeduplicationevents AT wilsonamandae wgdtreeaphylogeneticsoftwaretooltoexamineconditionalprobabilitiesofretentionfollowingwholegenomeduplicationevents AT miraszekjohnl wgdtreeaphylogeneticsoftwaretooltoexamineconditionalprobabilitiesofretentionfollowingwholegenomeduplicationevents AT probstclaires wgdtreeaphylogeneticsoftwaretooltoexamineconditionalprobabilitiesofretentionfollowingwholegenomeduplicationevents AT rongyuying wgdtreeaphylogeneticsoftwaretooltoexamineconditionalprobabilitiesofretentionfollowingwholegenomeduplicationevents AT liberlesdavida wgdtreeaphylogeneticsoftwaretooltoexamineconditionalprobabilitiesofretentionfollowingwholegenomeduplicationevents |