Cargando…
OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information
Species phylogenetic trees represent the evolutionary processes of organisms, and they are fundamental in evolutionary research. Therefore, new methods have been developed to obtain more reliable species phylogenetic trees. A highly reliable method is the construction of an ortholog data set based o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991595/ https://www.ncbi.nlm.nih.gov/pubmed/36799928 http://dx.doi.org/10.1093/gbe/evad026 |
_version_ | 1784902185314680832 |
---|---|
author | Watanabe, Tomoaki Kure, Akinori Horiike, Tokumasa |
author_facet | Watanabe, Tomoaki Kure, Akinori Horiike, Tokumasa |
author_sort | Watanabe, Tomoaki |
collection | PubMed |
description | Species phylogenetic trees represent the evolutionary processes of organisms, and they are fundamental in evolutionary research. Therefore, new methods have been developed to obtain more reliable species phylogenetic trees. A highly reliable method is the construction of an ortholog data set based on sequence information of genes, which is then used to infer the species phylogenetic tree. However, although methods for constructing an ortholog data set for species phylogenetic analysis have been developed, they cannot remove some paralogs, which is necessary for reliable species phylogenetic inference. To address the limitations of current methods, we developed OrthoPhy, a program that excludes paralogs and constructs highly accurate ortholog data sets using taxonomic information dividing analyzed species into monophyletic groups. OrthoPhy can remove paralogs, detecting inconsistencies between taxonomic information and phylogenetic trees of candidate ortholog groups clustered by sequence similarity. Performance tests using evolutionary simulated sequences and real sequences of 40 bacteria revealed that the precision of ortholog inference by OrthoPhy is higher than that of existing programs. Additionally, the phylogenetic analysis of species was more accurate when performed using ortholog data sets constructed by OrthoPhy than that performed using data sets constructed by existing programs. Furthermore, we performed a benchmark test of the Quest for Orthologs using real sequence data and found that the concordance rate between the phylogenetic trees of orthologs inferred by OrthoPhy and those of species was higher than the rates obtained by other ortholog inference programs. Therefore, ortholog data sets constructed using OrthoPhy enabled a more accurate phylogenetic analysis of species than those constructed using the existing programs, and OrthoPhy can be used for the phylogenetic analysis of species even for distantly related species that have experienced many evolutionary events. |
format | Online Article Text |
id | pubmed-9991595 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-99915952023-03-08 OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information Watanabe, Tomoaki Kure, Akinori Horiike, Tokumasa Genome Biol Evol Article Species phylogenetic trees represent the evolutionary processes of organisms, and they are fundamental in evolutionary research. Therefore, new methods have been developed to obtain more reliable species phylogenetic trees. A highly reliable method is the construction of an ortholog data set based on sequence information of genes, which is then used to infer the species phylogenetic tree. However, although methods for constructing an ortholog data set for species phylogenetic analysis have been developed, they cannot remove some paralogs, which is necessary for reliable species phylogenetic inference. To address the limitations of current methods, we developed OrthoPhy, a program that excludes paralogs and constructs highly accurate ortholog data sets using taxonomic information dividing analyzed species into monophyletic groups. OrthoPhy can remove paralogs, detecting inconsistencies between taxonomic information and phylogenetic trees of candidate ortholog groups clustered by sequence similarity. Performance tests using evolutionary simulated sequences and real sequences of 40 bacteria revealed that the precision of ortholog inference by OrthoPhy is higher than that of existing programs. Additionally, the phylogenetic analysis of species was more accurate when performed using ortholog data sets constructed by OrthoPhy than that performed using data sets constructed by existing programs. Furthermore, we performed a benchmark test of the Quest for Orthologs using real sequence data and found that the concordance rate between the phylogenetic trees of orthologs inferred by OrthoPhy and those of species was higher than the rates obtained by other ortholog inference programs. Therefore, ortholog data sets constructed using OrthoPhy enabled a more accurate phylogenetic analysis of species than those constructed using the existing programs, and OrthoPhy can be used for the phylogenetic analysis of species even for distantly related species that have experienced many evolutionary events. Oxford University Press 2023-02-17 /pmc/articles/PMC9991595/ /pubmed/36799928 http://dx.doi.org/10.1093/gbe/evad026 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Article Watanabe, Tomoaki Kure, Akinori Horiike, Tokumasa OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information |
title | OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information |
title_full | OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information |
title_fullStr | OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information |
title_full_unstemmed | OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information |
title_short | OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information |
title_sort | orthophy: a program to construct ortholog data sets using taxonomic information |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991595/ https://www.ncbi.nlm.nih.gov/pubmed/36799928 http://dx.doi.org/10.1093/gbe/evad026 |
work_keys_str_mv | AT watanabetomoaki orthophyaprogramtoconstructorthologdatasetsusingtaxonomicinformation AT kureakinori orthophyaprogramtoconstructorthologdatasetsusingtaxonomicinformation AT horiiketokumasa orthophyaprogramtoconstructorthologdatasetsusingtaxonomicinformation |