Cargando…
Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
BACKGROUND: Sites of positive selection are identified by comparing observed evolutionary patterns to those expected under a null model for evolution in the absence of such selection. For protein-coding genes, the most common null model is that nonsynonymous and synonymous mutations fix at equal rat...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5240389/ https://www.ncbi.nlm.nih.gov/pubmed/28095902 http://dx.doi.org/10.1186/s13062-016-0172-z |
_version_ | 1782496060647669760 |
---|---|
author | Bloom, Jesse D. |
author_facet | Bloom, Jesse D. |
author_sort | Bloom, Jesse D. |
collection | PubMed |
description | BACKGROUND: Sites of positive selection are identified by comparing observed evolutionary patterns to those expected under a null model for evolution in the absence of such selection. For protein-coding genes, the most common null model is that nonsynonymous and synonymous mutations fix at equal rates; this unrealistic model has limited power to detect many interesting forms of selection. RESULTS: I describe a new approach that uses a null model based on experimental measurements of a gene’s site-specific amino-acid preferences generated by deep mutational scanning in the lab. This null model makes it possible to identify both diversifying selection for repeated amino-acid change and differential selection for mutations to amino acids that are unexpected given the measurements made in the lab. I show that this approach identifies sites of adaptive substitutions in four genes (lactamase, Gal4, influenza nucleoprotein, and influenza hemagglutinin) far better than a comparable method that simply compares the rates of nonsynonymous and synonymous substitutions. CONCLUSIONS: As rapid increases in biological data enable increasingly nuanced descriptions of the constraints on individual protein sites, approaches like the one here can improve our ability to identify many interesting forms of selection in natural sequences. REVIEWERS: This article was reviewed by Sebastian Maurer-Stroh, Olivier Tenaillon, and Tal Pupko. All three reviewers are members of the Biology Direct editorial board. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-016-0172-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5240389 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52403892017-01-19 Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models Bloom, Jesse D. Biol Direct Research BACKGROUND: Sites of positive selection are identified by comparing observed evolutionary patterns to those expected under a null model for evolution in the absence of such selection. For protein-coding genes, the most common null model is that nonsynonymous and synonymous mutations fix at equal rates; this unrealistic model has limited power to detect many interesting forms of selection. RESULTS: I describe a new approach that uses a null model based on experimental measurements of a gene’s site-specific amino-acid preferences generated by deep mutational scanning in the lab. This null model makes it possible to identify both diversifying selection for repeated amino-acid change and differential selection for mutations to amino acids that are unexpected given the measurements made in the lab. I show that this approach identifies sites of adaptive substitutions in four genes (lactamase, Gal4, influenza nucleoprotein, and influenza hemagglutinin) far better than a comparable method that simply compares the rates of nonsynonymous and synonymous substitutions. CONCLUSIONS: As rapid increases in biological data enable increasingly nuanced descriptions of the constraints on individual protein sites, approaches like the one here can improve our ability to identify many interesting forms of selection in natural sequences. REVIEWERS: This article was reviewed by Sebastian Maurer-Stroh, Olivier Tenaillon, and Tal Pupko. All three reviewers are members of the Biology Direct editorial board. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-016-0172-z) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-17 /pmc/articles/PMC5240389/ /pubmed/28095902 http://dx.doi.org/10.1186/s13062-016-0172-z Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Bloom, Jesse D. Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models |
title | Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models |
title_full | Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models |
title_fullStr | Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models |
title_full_unstemmed | Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models |
title_short | Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models |
title_sort | identification of positive selection in genes is greatly improved by using experimentally informed site-specific models |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5240389/ https://www.ncbi.nlm.nih.gov/pubmed/28095902 http://dx.doi.org/10.1186/s13062-016-0172-z |
work_keys_str_mv | AT bloomjessed identificationofpositiveselectioningenesisgreatlyimprovedbyusingexperimentallyinformedsitespecificmodels |