Cargando…

Updated benchmarking of variant effect predictors using deep mutational scanning

The assessment of variant effect predictor (VEP) performance is fraught with biases introduced by benchmarking against clinical observations. In this study, building on our previous work, we use independently generated measurements of protein function from deep mutational scanning (DMS) experiments...

Descripción completa

Detalles Bibliográficos
Autores principales: Livesey, Benjamin J, Marsh, Joseph A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10407742/
https://www.ncbi.nlm.nih.gov/pubmed/37310135
http://dx.doi.org/10.15252/msb.202211474
_version_ 1785086031230402560
author Livesey, Benjamin J
Marsh, Joseph A
author_facet Livesey, Benjamin J
Marsh, Joseph A
author_sort Livesey, Benjamin J
collection PubMed
description The assessment of variant effect predictor (VEP) performance is fraught with biases introduced by benchmarking against clinical observations. In this study, building on our previous work, we use independently generated measurements of protein function from deep mutational scanning (DMS) experiments for 26 human proteins to benchmark 55 different VEPs, while introducing minimal data circularity. Many top‐performing VEPs are unsupervised methods including EVE, DeepSequence and ESM‐1v, a protein language model that ranked first overall. However, the strong performance of recent supervised VEPs, in particular VARITY, shows that developers are taking data circularity and bias issues seriously. We also assess the performance of DMS and unsupervised VEPs for discriminating between known pathogenic and putatively benign missense variants. Our findings are mixed, demonstrating that some DMS datasets perform exceptionally at variant classification, while others are poor. Notably, we observe a striking correlation between VEP agreement with DMS data and performance in identifying clinically relevant variants, strongly supporting the validity of our rankings and the utility of DMS for independent benchmarking.
format Online
Article
Text
id pubmed-10407742
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-104077422023-08-09 Updated benchmarking of variant effect predictors using deep mutational scanning Livesey, Benjamin J Marsh, Joseph A Mol Syst Biol Articles The assessment of variant effect predictor (VEP) performance is fraught with biases introduced by benchmarking against clinical observations. In this study, building on our previous work, we use independently generated measurements of protein function from deep mutational scanning (DMS) experiments for 26 human proteins to benchmark 55 different VEPs, while introducing minimal data circularity. Many top‐performing VEPs are unsupervised methods including EVE, DeepSequence and ESM‐1v, a protein language model that ranked first overall. However, the strong performance of recent supervised VEPs, in particular VARITY, shows that developers are taking data circularity and bias issues seriously. We also assess the performance of DMS and unsupervised VEPs for discriminating between known pathogenic and putatively benign missense variants. Our findings are mixed, demonstrating that some DMS datasets perform exceptionally at variant classification, while others are poor. Notably, we observe a striking correlation between VEP agreement with DMS data and performance in identifying clinically relevant variants, strongly supporting the validity of our rankings and the utility of DMS for independent benchmarking. John Wiley and Sons Inc. 2023-06-13 /pmc/articles/PMC10407742/ /pubmed/37310135 http://dx.doi.org/10.15252/msb.202211474 Text en © 2023 The Authors. Published under the terms of the CC BY 4.0 license. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Livesey, Benjamin J
Marsh, Joseph A
Updated benchmarking of variant effect predictors using deep mutational scanning
title Updated benchmarking of variant effect predictors using deep mutational scanning
title_full Updated benchmarking of variant effect predictors using deep mutational scanning
title_fullStr Updated benchmarking of variant effect predictors using deep mutational scanning
title_full_unstemmed Updated benchmarking of variant effect predictors using deep mutational scanning
title_short Updated benchmarking of variant effect predictors using deep mutational scanning
title_sort updated benchmarking of variant effect predictors using deep mutational scanning
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10407742/
https://www.ncbi.nlm.nih.gov/pubmed/37310135
http://dx.doi.org/10.15252/msb.202211474
work_keys_str_mv AT liveseybenjaminj updatedbenchmarkingofvarianteffectpredictorsusingdeepmutationalscanning
AT marshjosepha updatedbenchmarkingofvarianteffectpredictorsusingdeepmutationalscanning