Cargando…

OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information

Species phylogenetic trees represent the evolutionary processes of organisms, and they are fundamental in evolutionary research. Therefore, new methods have been developed to obtain more reliable species phylogenetic trees. A highly reliable method is the construction of an ortholog data set based o...

Descripción completa

Detalles Bibliográficos
Autores principales: Watanabe, Tomoaki, Kure, Akinori, Horiike, Tokumasa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991595/
https://www.ncbi.nlm.nih.gov/pubmed/36799928
http://dx.doi.org/10.1093/gbe/evad026
_version_ 1784902185314680832
author Watanabe, Tomoaki
Kure, Akinori
Horiike, Tokumasa
author_facet Watanabe, Tomoaki
Kure, Akinori
Horiike, Tokumasa
author_sort Watanabe, Tomoaki
collection PubMed
description Species phylogenetic trees represent the evolutionary processes of organisms, and they are fundamental in evolutionary research. Therefore, new methods have been developed to obtain more reliable species phylogenetic trees. A highly reliable method is the construction of an ortholog data set based on sequence information of genes, which is then used to infer the species phylogenetic tree. However, although methods for constructing an ortholog data set for species phylogenetic analysis have been developed, they cannot remove some paralogs, which is necessary for reliable species phylogenetic inference. To address the limitations of current methods, we developed OrthoPhy, a program that excludes paralogs and constructs highly accurate ortholog data sets using taxonomic information dividing analyzed species into monophyletic groups. OrthoPhy can remove paralogs, detecting inconsistencies between taxonomic information and phylogenetic trees of candidate ortholog groups clustered by sequence similarity. Performance tests using evolutionary simulated sequences and real sequences of 40 bacteria revealed that the precision of ortholog inference by OrthoPhy is higher than that of existing programs. Additionally, the phylogenetic analysis of species was more accurate when performed using ortholog data sets constructed by OrthoPhy than that performed using data sets constructed by existing programs. Furthermore, we performed a benchmark test of the Quest for Orthologs using real sequence data and found that the concordance rate between the phylogenetic trees of orthologs inferred by OrthoPhy and those of species was higher than the rates obtained by other ortholog inference programs. Therefore, ortholog data sets constructed using OrthoPhy enabled a more accurate phylogenetic analysis of species than those constructed using the existing programs, and OrthoPhy can be used for the phylogenetic analysis of species even for distantly related species that have experienced many evolutionary events.
format Online
Article
Text
id pubmed-9991595
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99915952023-03-08 OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information Watanabe, Tomoaki Kure, Akinori Horiike, Tokumasa Genome Biol Evol Article Species phylogenetic trees represent the evolutionary processes of organisms, and they are fundamental in evolutionary research. Therefore, new methods have been developed to obtain more reliable species phylogenetic trees. A highly reliable method is the construction of an ortholog data set based on sequence information of genes, which is then used to infer the species phylogenetic tree. However, although methods for constructing an ortholog data set for species phylogenetic analysis have been developed, they cannot remove some paralogs, which is necessary for reliable species phylogenetic inference. To address the limitations of current methods, we developed OrthoPhy, a program that excludes paralogs and constructs highly accurate ortholog data sets using taxonomic information dividing analyzed species into monophyletic groups. OrthoPhy can remove paralogs, detecting inconsistencies between taxonomic information and phylogenetic trees of candidate ortholog groups clustered by sequence similarity. Performance tests using evolutionary simulated sequences and real sequences of 40 bacteria revealed that the precision of ortholog inference by OrthoPhy is higher than that of existing programs. Additionally, the phylogenetic analysis of species was more accurate when performed using ortholog data sets constructed by OrthoPhy than that performed using data sets constructed by existing programs. Furthermore, we performed a benchmark test of the Quest for Orthologs using real sequence data and found that the concordance rate between the phylogenetic trees of orthologs inferred by OrthoPhy and those of species was higher than the rates obtained by other ortholog inference programs. Therefore, ortholog data sets constructed using OrthoPhy enabled a more accurate phylogenetic analysis of species than those constructed using the existing programs, and OrthoPhy can be used for the phylogenetic analysis of species even for distantly related species that have experienced many evolutionary events. Oxford University Press 2023-02-17 /pmc/articles/PMC9991595/ /pubmed/36799928 http://dx.doi.org/10.1093/gbe/evad026 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Article
Watanabe, Tomoaki
Kure, Akinori
Horiike, Tokumasa
OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information
title OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information
title_full OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information
title_fullStr OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information
title_full_unstemmed OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information
title_short OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information
title_sort orthophy: a program to construct ortholog data sets using taxonomic information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991595/
https://www.ncbi.nlm.nih.gov/pubmed/36799928
http://dx.doi.org/10.1093/gbe/evad026
work_keys_str_mv AT watanabetomoaki orthophyaprogramtoconstructorthologdatasetsusingtaxonomicinformation
AT kureakinori orthophyaprogramtoconstructorthologdatasetsusingtaxonomicinformation
AT horiiketokumasa orthophyaprogramtoconstructorthologdatasetsusingtaxonomicinformation