Cargando…

Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence

On macroevolutionary timescales, extensive mutations and phylogenetic uncertainty mask the signals of genotype–phenotype associations underlying convergent evolution. To overcome this problem, we extended the widely used framework of non-synonymous to synonymous substitution rate ratios and develope...

Descripción completa

Detalles Bibliográficos
Autores principales: Fukushima, Kenji, Pollock, David D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9834058/
https://www.ncbi.nlm.nih.gov/pubmed/36604553
http://dx.doi.org/10.1038/s41559-022-01932-7
_version_ 1784868376386994176
author Fukushima, Kenji
Pollock, David D.
author_facet Fukushima, Kenji
Pollock, David D.
author_sort Fukushima, Kenji
collection PubMed
description On macroevolutionary timescales, extensive mutations and phylogenetic uncertainty mask the signals of genotype–phenotype associations underlying convergent evolution. To overcome this problem, we extended the widely used framework of non-synonymous to synonymous substitution rate ratios and developed the novel metric ω(C), which measures the error-corrected convergence rate of protein evolution. While ω(C) distinguishes natural selection from genetic noise and phylogenetic errors in simulation and real examples, its accuracy allows an exploratory genome-wide search of adaptive molecular convergence without phenotypic hypothesis or candidate genes. Using gene expression data, we explored over 20 million branch combinations in vertebrate genes and identified the joint convergence of expression patterns and protein sequences with amino acid substitutions in functionally important sites, providing hypotheses on undiscovered phenotypes. We further extended our method with a heuristic algorithm to detect highly repetitive convergence among computationally non-trivial higher-order phylogenetic combinations. Our approach allows bidirectional searches for genotype–phenotype associations, even in lineages that diverged for hundreds of millions of years.
format Online
Article
Text
id pubmed-9834058
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98340582023-01-13 Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence Fukushima, Kenji Pollock, David D. Nat Ecol Evol Article On macroevolutionary timescales, extensive mutations and phylogenetic uncertainty mask the signals of genotype–phenotype associations underlying convergent evolution. To overcome this problem, we extended the widely used framework of non-synonymous to synonymous substitution rate ratios and developed the novel metric ω(C), which measures the error-corrected convergence rate of protein evolution. While ω(C) distinguishes natural selection from genetic noise and phylogenetic errors in simulation and real examples, its accuracy allows an exploratory genome-wide search of adaptive molecular convergence without phenotypic hypothesis or candidate genes. Using gene expression data, we explored over 20 million branch combinations in vertebrate genes and identified the joint convergence of expression patterns and protein sequences with amino acid substitutions in functionally important sites, providing hypotheses on undiscovered phenotypes. We further extended our method with a heuristic algorithm to detect highly repetitive convergence among computationally non-trivial higher-order phylogenetic combinations. Our approach allows bidirectional searches for genotype–phenotype associations, even in lineages that diverged for hundreds of millions of years. Nature Publishing Group UK 2023-01-05 2023 /pmc/articles/PMC9834058/ /pubmed/36604553 http://dx.doi.org/10.1038/s41559-022-01932-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Fukushima, Kenji
Pollock, David D.
Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence
title Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence
title_full Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence
title_fullStr Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence
title_full_unstemmed Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence
title_short Detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence
title_sort detecting macroevolutionary genotype–phenotype associations using error-corrected rates of protein convergence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9834058/
https://www.ncbi.nlm.nih.gov/pubmed/36604553
http://dx.doi.org/10.1038/s41559-022-01932-7
work_keys_str_mv AT fukushimakenji detectingmacroevolutionarygenotypephenotypeassociationsusingerrorcorrectedratesofproteinconvergence
AT pollockdavidd detectingmacroevolutionarygenotypephenotypeassociationsusingerrorcorrectedratesofproteinconvergence