Cargando…

Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models

BACKGROUND: Sites of positive selection are identified by comparing observed evolutionary patterns to those expected under a null model for evolution in the absence of such selection. For protein-coding genes, the most common null model is that nonsynonymous and synonymous mutations fix at equal rat...

Descripción completa

Detalles Bibliográficos
Autor principal: Bloom, Jesse D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5240389/
https://www.ncbi.nlm.nih.gov/pubmed/28095902
http://dx.doi.org/10.1186/s13062-016-0172-z
_version_ 1782496060647669760
author Bloom, Jesse D.
author_facet Bloom, Jesse D.
author_sort Bloom, Jesse D.
collection PubMed
description BACKGROUND: Sites of positive selection are identified by comparing observed evolutionary patterns to those expected under a null model for evolution in the absence of such selection. For protein-coding genes, the most common null model is that nonsynonymous and synonymous mutations fix at equal rates; this unrealistic model has limited power to detect many interesting forms of selection. RESULTS: I describe a new approach that uses a null model based on experimental measurements of a gene’s site-specific amino-acid preferences generated by deep mutational scanning in the lab. This null model makes it possible to identify both diversifying selection for repeated amino-acid change and differential selection for mutations to amino acids that are unexpected given the measurements made in the lab. I show that this approach identifies sites of adaptive substitutions in four genes (lactamase, Gal4, influenza nucleoprotein, and influenza hemagglutinin) far better than a comparable method that simply compares the rates of nonsynonymous and synonymous substitutions. CONCLUSIONS: As rapid increases in biological data enable increasingly nuanced descriptions of the constraints on individual protein sites, approaches like the one here can improve our ability to identify many interesting forms of selection in natural sequences. REVIEWERS: This article was reviewed by Sebastian Maurer-Stroh, Olivier Tenaillon, and Tal Pupko. All three reviewers are members of the Biology Direct editorial board. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-016-0172-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5240389
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52403892017-01-19 Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models Bloom, Jesse D. Biol Direct Research BACKGROUND: Sites of positive selection are identified by comparing observed evolutionary patterns to those expected under a null model for evolution in the absence of such selection. For protein-coding genes, the most common null model is that nonsynonymous and synonymous mutations fix at equal rates; this unrealistic model has limited power to detect many interesting forms of selection. RESULTS: I describe a new approach that uses a null model based on experimental measurements of a gene’s site-specific amino-acid preferences generated by deep mutational scanning in the lab. This null model makes it possible to identify both diversifying selection for repeated amino-acid change and differential selection for mutations to amino acids that are unexpected given the measurements made in the lab. I show that this approach identifies sites of adaptive substitutions in four genes (lactamase, Gal4, influenza nucleoprotein, and influenza hemagglutinin) far better than a comparable method that simply compares the rates of nonsynonymous and synonymous substitutions. CONCLUSIONS: As rapid increases in biological data enable increasingly nuanced descriptions of the constraints on individual protein sites, approaches like the one here can improve our ability to identify many interesting forms of selection in natural sequences. REVIEWERS: This article was reviewed by Sebastian Maurer-Stroh, Olivier Tenaillon, and Tal Pupko. All three reviewers are members of the Biology Direct editorial board. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-016-0172-z) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-17 /pmc/articles/PMC5240389/ /pubmed/28095902 http://dx.doi.org/10.1186/s13062-016-0172-z Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Bloom, Jesse D.
Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
title Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
title_full Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
title_fullStr Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
title_full_unstemmed Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
title_short Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
title_sort identification of positive selection in genes is greatly improved by using experimentally informed site-specific models
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5240389/
https://www.ncbi.nlm.nih.gov/pubmed/28095902
http://dx.doi.org/10.1186/s13062-016-0172-z
work_keys_str_mv AT bloomjessed identificationofpositiveselectioningenesisgreatlyimprovedbyusingexperimentallyinformedsitespecificmodels