Cargando…

Systematic errors in orthology inference and their effects on evolutionary analyses

The availability of complete sets of genes from many organisms makes it possible to identify genes unique to (or lost from) certain clades. This information is used to reconstruct phylogenetic trees; identify genes involved in the evolution of clade specific novelties; and for phylostratigraphy—iden...

Descripción completa

Detalles Bibliográficos
Autores principales: Natsidis, Paschalis, Kapli, Paschalia, Schiffer, Philipp H., Telford, Maximilian J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7892920/
https://www.ncbi.nlm.nih.gov/pubmed/33659875
http://dx.doi.org/10.1016/j.isci.2021.102110
_version_ 1783652952316051456
author Natsidis, Paschalis
Kapli, Paschalia
Schiffer, Philipp H.
Telford, Maximilian J.
author_facet Natsidis, Paschalis
Kapli, Paschalia
Schiffer, Philipp H.
Telford, Maximilian J.
author_sort Natsidis, Paschalis
collection PubMed
description The availability of complete sets of genes from many organisms makes it possible to identify genes unique to (or lost from) certain clades. This information is used to reconstruct phylogenetic trees; identify genes involved in the evolution of clade specific novelties; and for phylostratigraphy—identifying ages of genes in a given species. These investigations rely on accurately predicted orthologs. Here we use simulation to produce sets of orthologs that experience no gains or losses. We show that errors in identifying orthologs increase with higher rates of evolution. We use the predicted sets of orthologs, with errors, to reconstruct phylogenetic trees; to count gains and losses; and for phylostratigraphy. Our simulated data, containing information only from errors in orthology prediction, closely recapitulate findings from empirical data. We suggest published downstream analyses must be informed to a large extent by errors in orthology prediction that mimic expected patterns of gene evolution.
format Online
Article
Text
id pubmed-7892920
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-78929202021-03-02 Systematic errors in orthology inference and their effects on evolutionary analyses Natsidis, Paschalis Kapli, Paschalia Schiffer, Philipp H. Telford, Maximilian J. iScience Article The availability of complete sets of genes from many organisms makes it possible to identify genes unique to (or lost from) certain clades. This information is used to reconstruct phylogenetic trees; identify genes involved in the evolution of clade specific novelties; and for phylostratigraphy—identifying ages of genes in a given species. These investigations rely on accurately predicted orthologs. Here we use simulation to produce sets of orthologs that experience no gains or losses. We show that errors in identifying orthologs increase with higher rates of evolution. We use the predicted sets of orthologs, with errors, to reconstruct phylogenetic trees; to count gains and losses; and for phylostratigraphy. Our simulated data, containing information only from errors in orthology prediction, closely recapitulate findings from empirical data. We suggest published downstream analyses must be informed to a large extent by errors in orthology prediction that mimic expected patterns of gene evolution. Elsevier 2021-01-28 /pmc/articles/PMC7892920/ /pubmed/33659875 http://dx.doi.org/10.1016/j.isci.2021.102110 Text en © 2021 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Natsidis, Paschalis
Kapli, Paschalia
Schiffer, Philipp H.
Telford, Maximilian J.
Systematic errors in orthology inference and their effects on evolutionary analyses
title Systematic errors in orthology inference and their effects on evolutionary analyses
title_full Systematic errors in orthology inference and their effects on evolutionary analyses
title_fullStr Systematic errors in orthology inference and their effects on evolutionary analyses
title_full_unstemmed Systematic errors in orthology inference and their effects on evolutionary analyses
title_short Systematic errors in orthology inference and their effects on evolutionary analyses
title_sort systematic errors in orthology inference and their effects on evolutionary analyses
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7892920/
https://www.ncbi.nlm.nih.gov/pubmed/33659875
http://dx.doi.org/10.1016/j.isci.2021.102110
work_keys_str_mv AT natsidispaschalis systematicerrorsinorthologyinferenceandtheireffectsonevolutionaryanalyses
AT kaplipaschalia systematicerrorsinorthologyinferenceandtheireffectsonevolutionaryanalyses
AT schifferphilipph systematicerrorsinorthologyinferenceandtheireffectsonevolutionaryanalyses
AT telfordmaximilianj systematicerrorsinorthologyinferenceandtheireffectsonevolutionaryanalyses