Cargando…

Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions

[Image: see text] The medically relevant field of protein-based therapeutics has triggered a demand for protein engineering in different pH environments of biological relevance. In silico engineering workflows typically employ high-throughput screening campaigns that require evaluating large sets of...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Wanlei, Hogues, Hervé, Sulea, Traian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10466379/
https://www.ncbi.nlm.nih.gov/pubmed/37549424
http://dx.doi.org/10.1021/acs.jcim.3c00165
_version_ 1785098870028500992
author Wei, Wanlei
Hogues, Hervé
Sulea, Traian
author_facet Wei, Wanlei
Hogues, Hervé
Sulea, Traian
author_sort Wei, Wanlei
collection PubMed
description [Image: see text] The medically relevant field of protein-based therapeutics has triggered a demand for protein engineering in different pH environments of biological relevance. In silico engineering workflows typically employ high-throughput screening campaigns that require evaluating large sets of protein residues and point mutations by fast yet accurate computational algorithms. While several high-throughput pK(a) prediction methods exist, their accuracies are unclear due to the lack of a current comprehensive benchmarking. Here, seven fast, efficient, and accessible approaches including PROPKA3, DeepKa, PKAI, PKAI+, DelPhiPKa, MCCE2, and H++ were systematically tested on a nonredundant subset of 408 measured protein residue pK(a) shifts from the pK(a) database (PKAD). While no method outperformed the null hypotheses with confidence, as illustrated by statistical bootstrapping, DeepKa, PKAI+, PROPKA3, and H++ had utility. More specifically, DeepKa consistently performed well in tests across multiple and individual amino acid residue types, as reflected by lower errors, higher correlations, and improved classifications. Arithmetic averaging of the best empirical predictors into simple consensuses improved overall transferability and accuracy up to a root-mean-square error of 0.76 pK(a) units and a correlation coefficient (R(2)) of 0.45 to experimental pK(a) shifts. This analysis should provide a basis for further methodological developments and guide future applications, which require embedding of computationally inexpensive pK(a) prediction methods, such as the optimization of antibodies for pH-dependent antigen binding.
format Online
Article
Text
id pubmed-10466379
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-104663792023-08-31 Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions Wei, Wanlei Hogues, Hervé Sulea, Traian J Chem Inf Model [Image: see text] The medically relevant field of protein-based therapeutics has triggered a demand for protein engineering in different pH environments of biological relevance. In silico engineering workflows typically employ high-throughput screening campaigns that require evaluating large sets of protein residues and point mutations by fast yet accurate computational algorithms. While several high-throughput pK(a) prediction methods exist, their accuracies are unclear due to the lack of a current comprehensive benchmarking. Here, seven fast, efficient, and accessible approaches including PROPKA3, DeepKa, PKAI, PKAI+, DelPhiPKa, MCCE2, and H++ were systematically tested on a nonredundant subset of 408 measured protein residue pK(a) shifts from the pK(a) database (PKAD). While no method outperformed the null hypotheses with confidence, as illustrated by statistical bootstrapping, DeepKa, PKAI+, PROPKA3, and H++ had utility. More specifically, DeepKa consistently performed well in tests across multiple and individual amino acid residue types, as reflected by lower errors, higher correlations, and improved classifications. Arithmetic averaging of the best empirical predictors into simple consensuses improved overall transferability and accuracy up to a root-mean-square error of 0.76 pK(a) units and a correlation coefficient (R(2)) of 0.45 to experimental pK(a) shifts. This analysis should provide a basis for further methodological developments and guide future applications, which require embedding of computationally inexpensive pK(a) prediction methods, such as the optimization of antibodies for pH-dependent antigen binding. American Chemical Society 2023-08-08 /pmc/articles/PMC10466379/ /pubmed/37549424 http://dx.doi.org/10.1021/acs.jcim.3c00165 Text en Crown © 2023. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Wei, Wanlei
Hogues, Hervé
Sulea, Traian
Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions
title Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions
title_full Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions
title_fullStr Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions
title_full_unstemmed Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions
title_short Comparative Performance of High-Throughput Methods for Protein pK(a) Predictions
title_sort comparative performance of high-throughput methods for protein pk(a) predictions
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10466379/
https://www.ncbi.nlm.nih.gov/pubmed/37549424
http://dx.doi.org/10.1021/acs.jcim.3c00165
work_keys_str_mv AT weiwanlei comparativeperformanceofhighthroughputmethodsforproteinpkapredictions
AT hoguesherve comparativeperformanceofhighthroughputmethodsforproteinpkapredictions
AT suleatraian comparativeperformanceofhighthroughputmethodsforproteinpkapredictions