Cargando…

Inferring a complete genotype-phenotype map from a small number of measured phenotypes

Understanding evolution requires detailed knowledge of genotype-phenotype maps; however, it can be a herculean task to measure every phenotype in a combinatorial map. We have developed a computational strategy to predict the missing phenotypes from an incomplete, combinatorial genotype-phenotype map...

Descripción completa

Detalles Bibliográficos
Autores principales: Sailer, Zachary R., Shafik, Sarah H., Summers, Robert L., Joule, Alex, Patterson-Robert, Alice, Martin, Rowena E., Harms, Michael J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7546491/
https://www.ncbi.nlm.nih.gov/pubmed/32991585
http://dx.doi.org/10.1371/journal.pcbi.1008243
_version_ 1783592239839051776
author Sailer, Zachary R.
Shafik, Sarah H.
Summers, Robert L.
Joule, Alex
Patterson-Robert, Alice
Martin, Rowena E.
Harms, Michael J.
author_facet Sailer, Zachary R.
Shafik, Sarah H.
Summers, Robert L.
Joule, Alex
Patterson-Robert, Alice
Martin, Rowena E.
Harms, Michael J.
author_sort Sailer, Zachary R.
collection PubMed
description Understanding evolution requires detailed knowledge of genotype-phenotype maps; however, it can be a herculean task to measure every phenotype in a combinatorial map. We have developed a computational strategy to predict the missing phenotypes from an incomplete, combinatorial genotype-phenotype map. As a test case, we used an incomplete genotype-phenotype dataset previously generated for the malaria parasite’s ‘chloroquine resistance transporter’ (PfCRT). Wild-type PfCRT (PfCRT(3D7)) lacks significant chloroquine (CQ) transport activity, but the introduction of the eight mutations present in the ‘Dd2’ isoform of PfCRT (PfCRT(Dd2)) enables the protein to transport CQ away from its site of antimalarial action. This gain of a transport function imparts CQ resistance to the parasite. A combinatorial map between PfCRT(3D7) and PfCRT(Dd2) consists of 256 genotypes, of which only 52 have had their CQ transport activities measured through expression in the Xenopus laevis oocyte. We trained a statistical model with these 52 measurements to infer the CQ transport activity for the remaining 204 combinatorial genotypes between PfCRT(3D7) and PfCRT(Dd2). Our best-performing model incorporated a binary classifier, a nonlinear scale, and additive effects for each mutation. The addition of specific pairwise- and high-order-epistatic coefficients decreased the predictive power of the model. We evaluated our predictions by experimentally measuring the CQ transport activities of 24 additional PfCRT genotypes. The R(2) value between our predicted and newly-measured phenotypes was 0.90. We then used the model to probe the accessibility of evolutionary trajectories through the map. Approximately 1% of the possible trajectories between PfCRT(3D7) and PfCRT(Dd2) are accessible; however, none of the trajectories entailed eight successive increases in CQ transport activity. These results demonstrate that phenotypes can be inferred with known uncertainty from a partial genotype-phenotype dataset. We also validated our approach against a collection of previously published genotype-phenotype maps. The model therefore appears general and should be applicable to a large number of genotype-phenotype maps.
format Online
Article
Text
id pubmed-7546491
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75464912020-10-19 Inferring a complete genotype-phenotype map from a small number of measured phenotypes Sailer, Zachary R. Shafik, Sarah H. Summers, Robert L. Joule, Alex Patterson-Robert, Alice Martin, Rowena E. Harms, Michael J. PLoS Comput Biol Research Article Understanding evolution requires detailed knowledge of genotype-phenotype maps; however, it can be a herculean task to measure every phenotype in a combinatorial map. We have developed a computational strategy to predict the missing phenotypes from an incomplete, combinatorial genotype-phenotype map. As a test case, we used an incomplete genotype-phenotype dataset previously generated for the malaria parasite’s ‘chloroquine resistance transporter’ (PfCRT). Wild-type PfCRT (PfCRT(3D7)) lacks significant chloroquine (CQ) transport activity, but the introduction of the eight mutations present in the ‘Dd2’ isoform of PfCRT (PfCRT(Dd2)) enables the protein to transport CQ away from its site of antimalarial action. This gain of a transport function imparts CQ resistance to the parasite. A combinatorial map between PfCRT(3D7) and PfCRT(Dd2) consists of 256 genotypes, of which only 52 have had their CQ transport activities measured through expression in the Xenopus laevis oocyte. We trained a statistical model with these 52 measurements to infer the CQ transport activity for the remaining 204 combinatorial genotypes between PfCRT(3D7) and PfCRT(Dd2). Our best-performing model incorporated a binary classifier, a nonlinear scale, and additive effects for each mutation. The addition of specific pairwise- and high-order-epistatic coefficients decreased the predictive power of the model. We evaluated our predictions by experimentally measuring the CQ transport activities of 24 additional PfCRT genotypes. The R(2) value between our predicted and newly-measured phenotypes was 0.90. We then used the model to probe the accessibility of evolutionary trajectories through the map. Approximately 1% of the possible trajectories between PfCRT(3D7) and PfCRT(Dd2) are accessible; however, none of the trajectories entailed eight successive increases in CQ transport activity. These results demonstrate that phenotypes can be inferred with known uncertainty from a partial genotype-phenotype dataset. We also validated our approach against a collection of previously published genotype-phenotype maps. The model therefore appears general and should be applicable to a large number of genotype-phenotype maps. Public Library of Science 2020-09-29 /pmc/articles/PMC7546491/ /pubmed/32991585 http://dx.doi.org/10.1371/journal.pcbi.1008243 Text en © 2020 Sailer et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sailer, Zachary R.
Shafik, Sarah H.
Summers, Robert L.
Joule, Alex
Patterson-Robert, Alice
Martin, Rowena E.
Harms, Michael J.
Inferring a complete genotype-phenotype map from a small number of measured phenotypes
title Inferring a complete genotype-phenotype map from a small number of measured phenotypes
title_full Inferring a complete genotype-phenotype map from a small number of measured phenotypes
title_fullStr Inferring a complete genotype-phenotype map from a small number of measured phenotypes
title_full_unstemmed Inferring a complete genotype-phenotype map from a small number of measured phenotypes
title_short Inferring a complete genotype-phenotype map from a small number of measured phenotypes
title_sort inferring a complete genotype-phenotype map from a small number of measured phenotypes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7546491/
https://www.ncbi.nlm.nih.gov/pubmed/32991585
http://dx.doi.org/10.1371/journal.pcbi.1008243
work_keys_str_mv AT sailerzacharyr inferringacompletegenotypephenotypemapfromasmallnumberofmeasuredphenotypes
AT shafiksarahh inferringacompletegenotypephenotypemapfromasmallnumberofmeasuredphenotypes
AT summersrobertl inferringacompletegenotypephenotypemapfromasmallnumberofmeasuredphenotypes
AT joulealex inferringacompletegenotypephenotypemapfromasmallnumberofmeasuredphenotypes
AT pattersonrobertalice inferringacompletegenotypephenotypemapfromasmallnumberofmeasuredphenotypes
AT martinrowenae inferringacompletegenotypephenotypemapfromasmallnumberofmeasuredphenotypes
AT harmsmichaelj inferringacompletegenotypephenotypemapfromasmallnumberofmeasuredphenotypes