Cargando…

Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data

Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of...

Descripción completa

Detalles Bibliográficos
Autores principales: Sandell, Linnea, Sharp, Nathaniel P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790079/
https://www.ncbi.nlm.nih.gov/pubmed/35038732
http://dx.doi.org/10.1093/gbe/evac004
_version_ 1784639912716271616
author Sandell, Linnea
Sharp, Nathaniel P
author_facet Sandell, Linnea
Sharp, Nathaniel P
author_sort Sandell, Linnea
collection PubMed
description Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of one such tool, called PROVEAN. This program compares a query sequence with existing data to provide an alignment-based score for any protein variant, with scores categorized as neutral or deleterious based on a pre-set threshold. PROVEAN has been used widely in evolutionary studies, for example, to estimate mutation load in natural populations, but has not been formally tested as a predictor of aggregate mutational effects on fitness. Using three large published data sets on the genome sequences of laboratory mutation accumulation lines, we assessed how well PROVEAN predicted the actual fitness patterns observed, relative to other metrics. In most cases, we find that a simple count of the total number of mutant proteins is a better predictor of fitness than the number of proteins with variants scored as deleterious by PROVEAN. We also find that the sum of all mutant protein scores explains variation in fitness better than the number of mutant proteins in one of the data sets. We discuss the implications of these results for studies of populations in the wild.
format Online
Article
Text
id pubmed-8790079
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87900792022-01-26 Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data Sandell, Linnea Sharp, Nathaniel P Genome Biol Evol Research Article Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of one such tool, called PROVEAN. This program compares a query sequence with existing data to provide an alignment-based score for any protein variant, with scores categorized as neutral or deleterious based on a pre-set threshold. PROVEAN has been used widely in evolutionary studies, for example, to estimate mutation load in natural populations, but has not been formally tested as a predictor of aggregate mutational effects on fitness. Using three large published data sets on the genome sequences of laboratory mutation accumulation lines, we assessed how well PROVEAN predicted the actual fitness patterns observed, relative to other metrics. In most cases, we find that a simple count of the total number of mutant proteins is a better predictor of fitness than the number of proteins with variants scored as deleterious by PROVEAN. We also find that the sum of all mutant protein scores explains variation in fitness better than the number of mutant proteins in one of the data sets. We discuss the implications of these results for studies of populations in the wild. Oxford University Press 2022-01-17 /pmc/articles/PMC8790079/ /pubmed/35038732 http://dx.doi.org/10.1093/gbe/evac004 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sandell, Linnea
Sharp, Nathaniel P
Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data
title Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data
title_full Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data
title_fullStr Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data
title_full_unstemmed Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data
title_short Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data
title_sort fitness effects of mutations: an assessment of provean predictions using mutation accumulation data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790079/
https://www.ncbi.nlm.nih.gov/pubmed/35038732
http://dx.doi.org/10.1093/gbe/evac004
work_keys_str_mv AT sandelllinnea fitnesseffectsofmutationsanassessmentofproveanpredictionsusingmutationaccumulationdata
AT sharpnathanielp fitnesseffectsofmutationsanassessmentofproveanpredictionsusingmutationaccumulationdata