Cargando…
Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data
Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790079/ https://www.ncbi.nlm.nih.gov/pubmed/35038732 http://dx.doi.org/10.1093/gbe/evac004 |
_version_ | 1784639912716271616 |
---|---|
author | Sandell, Linnea Sharp, Nathaniel P |
author_facet | Sandell, Linnea Sharp, Nathaniel P |
author_sort | Sandell, Linnea |
collection | PubMed |
description | Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of one such tool, called PROVEAN. This program compares a query sequence with existing data to provide an alignment-based score for any protein variant, with scores categorized as neutral or deleterious based on a pre-set threshold. PROVEAN has been used widely in evolutionary studies, for example, to estimate mutation load in natural populations, but has not been formally tested as a predictor of aggregate mutational effects on fitness. Using three large published data sets on the genome sequences of laboratory mutation accumulation lines, we assessed how well PROVEAN predicted the actual fitness patterns observed, relative to other metrics. In most cases, we find that a simple count of the total number of mutant proteins is a better predictor of fitness than the number of proteins with variants scored as deleterious by PROVEAN. We also find that the sum of all mutant protein scores explains variation in fitness better than the number of mutant proteins in one of the data sets. We discuss the implications of these results for studies of populations in the wild. |
format | Online Article Text |
id | pubmed-8790079 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-87900792022-01-26 Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data Sandell, Linnea Sharp, Nathaniel P Genome Biol Evol Research Article Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of one such tool, called PROVEAN. This program compares a query sequence with existing data to provide an alignment-based score for any protein variant, with scores categorized as neutral or deleterious based on a pre-set threshold. PROVEAN has been used widely in evolutionary studies, for example, to estimate mutation load in natural populations, but has not been formally tested as a predictor of aggregate mutational effects on fitness. Using three large published data sets on the genome sequences of laboratory mutation accumulation lines, we assessed how well PROVEAN predicted the actual fitness patterns observed, relative to other metrics. In most cases, we find that a simple count of the total number of mutant proteins is a better predictor of fitness than the number of proteins with variants scored as deleterious by PROVEAN. We also find that the sum of all mutant protein scores explains variation in fitness better than the number of mutant proteins in one of the data sets. We discuss the implications of these results for studies of populations in the wild. Oxford University Press 2022-01-17 /pmc/articles/PMC8790079/ /pubmed/35038732 http://dx.doi.org/10.1093/gbe/evac004 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Sandell, Linnea Sharp, Nathaniel P Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data |
title | Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data |
title_full | Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data |
title_fullStr | Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data |
title_full_unstemmed | Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data |
title_short | Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data |
title_sort | fitness effects of mutations: an assessment of provean predictions using mutation accumulation data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790079/ https://www.ncbi.nlm.nih.gov/pubmed/35038732 http://dx.doi.org/10.1093/gbe/evac004 |
work_keys_str_mv | AT sandelllinnea fitnesseffectsofmutationsanassessmentofproveanpredictionsusingmutationaccumulationdata AT sharpnathanielp fitnesseffectsofmutationsanassessmentofproveanpredictionsusingmutationaccumulationdata |