Cargando…

An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza

Linkage effects in a multi-locus population strongly influence its evolution. The models based on the traveling wave approach enable us to predict the average speed of evolution and the statistics of phylogeny. However, predicting statistically the evolution of specific sites and pairs of sites in t...

Descripción completa

Detalles Bibliográficos
Autores principales: Pedruzzi, Gabriele, Rouzine, Igor M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248644/
https://www.ncbi.nlm.nih.gov/pubmed/34153082
http://dx.doi.org/10.1371/journal.ppat.1009669
_version_ 1783716766031020032
author Pedruzzi, Gabriele
Rouzine, Igor M.
author_facet Pedruzzi, Gabriele
Rouzine, Igor M.
author_sort Pedruzzi, Gabriele
collection PubMed
description Linkage effects in a multi-locus population strongly influence its evolution. The models based on the traveling wave approach enable us to predict the average speed of evolution and the statistics of phylogeny. However, predicting statistically the evolution of specific sites and pairs of sites in the multi-locus context remains a mathematical challenge. In particular, the effects of epistasis, the interaction of gene regions contributing to phenotype, is difficult to predict theoretically and detect experimentally in sequence data. A large number of false-positive interactions arises from stochastic linkage effects and indirect interactions, which mask true epistatic interactions. Here we develop a proof-of-principle method to filter out false-positive interactions. We start by demonstrating that the averaging of haplotype frequencies over multiple independent populations is necessary but not sufficient for epistatic detection, because it still leaves high numbers of false-positive interactions. To compensate for the residual stochastic noise, we develop a three-way haplotype method isolating true interactions. The fidelity of the method is confirmed analytically and on simulated genetic sequences evolved with a known epistatic network. The method is then applied to a large sequence database of neurominidase protein of influenza A H1N1 obtained from various geographic locations to infer the epistatic network responsible for the difference between the pre-pandemic virus and the pandemic strain of 2009. These results present a simple and reliable technique to measure epistatic interactions of any sign from sequence data.
format Online
Article
Text
id pubmed-8248644
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-82486442021-07-09 An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza Pedruzzi, Gabriele Rouzine, Igor M. PLoS Pathog Research Article Linkage effects in a multi-locus population strongly influence its evolution. The models based on the traveling wave approach enable us to predict the average speed of evolution and the statistics of phylogeny. However, predicting statistically the evolution of specific sites and pairs of sites in the multi-locus context remains a mathematical challenge. In particular, the effects of epistasis, the interaction of gene regions contributing to phenotype, is difficult to predict theoretically and detect experimentally in sequence data. A large number of false-positive interactions arises from stochastic linkage effects and indirect interactions, which mask true epistatic interactions. Here we develop a proof-of-principle method to filter out false-positive interactions. We start by demonstrating that the averaging of haplotype frequencies over multiple independent populations is necessary but not sufficient for epistatic detection, because it still leaves high numbers of false-positive interactions. To compensate for the residual stochastic noise, we develop a three-way haplotype method isolating true interactions. The fidelity of the method is confirmed analytically and on simulated genetic sequences evolved with a known epistatic network. The method is then applied to a large sequence database of neurominidase protein of influenza A H1N1 obtained from various geographic locations to infer the epistatic network responsible for the difference between the pre-pandemic virus and the pandemic strain of 2009. These results present a simple and reliable technique to measure epistatic interactions of any sign from sequence data. Public Library of Science 2021-06-21 /pmc/articles/PMC8248644/ /pubmed/34153082 http://dx.doi.org/10.1371/journal.ppat.1009669 Text en © 2021 Pedruzzi, Rouzine https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pedruzzi, Gabriele
Rouzine, Igor M.
An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza
title An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza
title_full An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza
title_fullStr An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza
title_full_unstemmed An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza
title_short An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza
title_sort evolution-based high-fidelity method of epistasis measurement: theory and application to influenza
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248644/
https://www.ncbi.nlm.nih.gov/pubmed/34153082
http://dx.doi.org/10.1371/journal.ppat.1009669
work_keys_str_mv AT pedruzzigabriele anevolutionbasedhighfidelitymethodofepistasismeasurementtheoryandapplicationtoinfluenza
AT rouzineigorm anevolutionbasedhighfidelitymethodofepistasismeasurementtheoryandapplicationtoinfluenza
AT pedruzzigabriele evolutionbasedhighfidelitymethodofepistasismeasurementtheoryandapplicationtoinfluenza
AT rouzineigorm evolutionbasedhighfidelitymethodofepistasismeasurementtheoryandapplicationtoinfluenza