Cargando…

Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs

[Image: see text] Side-chain rotamer prediction is one of the most critical late stages in protein 3D structure building. Highly advanced and specialized algorithms (e.g., FASPR, RASP, SCWRL4, and SCWRL4v) optimize this process by use of rotamer libraries, combinatorial searches, and scoring functio...

Descripción completa

Detalles Bibliográficos
Autores principales: Hameduh, Tareq, Mokry, Michal, Miller, Andrew D., Heger, Zbynek, Haddad, Yazan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369486/
https://www.ncbi.nlm.nih.gov/pubmed/37410883
http://dx.doi.org/10.1021/acs.jcim.3c00134
_version_ 1785077770808721408
author Hameduh, Tareq
Mokry, Michal
Miller, Andrew D.
Heger, Zbynek
Haddad, Yazan
author_facet Hameduh, Tareq
Mokry, Michal
Miller, Andrew D.
Heger, Zbynek
Haddad, Yazan
author_sort Hameduh, Tareq
collection PubMed
description [Image: see text] Side-chain rotamer prediction is one of the most critical late stages in protein 3D structure building. Highly advanced and specialized algorithms (e.g., FASPR, RASP, SCWRL4, and SCWRL4v) optimize this process by use of rotamer libraries, combinatorial searches, and scoring functions. We seek to identify the sources of key rotamer errors as a basis for correcting and improving the accuracy of protein modeling going forward. In order to evaluate the aforementioned programs, we process 2496 high-quality single-chained all-atom filtered 30% homology protein 3D structures and use discretized rotamer analysis to compare original with calculated structures. Among 513,024 filtered residue records, increased amino acid residue-dependent rotamer errors—associated in particular with polar and charged amino acid residues (ARG, LYS, and GLN)—clearly correlate with increased amino acid residue solvent accessibility and an increased residue tendency toward the adoption of non-canonical off rotamers which modeling programs struggle to predict accurately. Understanding the impact of solvent accessibility now appears key to improved side-chain prediction accuracies.
format Online
Article
Text
id pubmed-10369486
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-103694862023-07-27 Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs Hameduh, Tareq Mokry, Michal Miller, Andrew D. Heger, Zbynek Haddad, Yazan J Chem Inf Model [Image: see text] Side-chain rotamer prediction is one of the most critical late stages in protein 3D structure building. Highly advanced and specialized algorithms (e.g., FASPR, RASP, SCWRL4, and SCWRL4v) optimize this process by use of rotamer libraries, combinatorial searches, and scoring functions. We seek to identify the sources of key rotamer errors as a basis for correcting and improving the accuracy of protein modeling going forward. In order to evaluate the aforementioned programs, we process 2496 high-quality single-chained all-atom filtered 30% homology protein 3D structures and use discretized rotamer analysis to compare original with calculated structures. Among 513,024 filtered residue records, increased amino acid residue-dependent rotamer errors—associated in particular with polar and charged amino acid residues (ARG, LYS, and GLN)—clearly correlate with increased amino acid residue solvent accessibility and an increased residue tendency toward the adoption of non-canonical off rotamers which modeling programs struggle to predict accurately. Understanding the impact of solvent accessibility now appears key to improved side-chain prediction accuracies. American Chemical Society 2023-07-06 /pmc/articles/PMC10369486/ /pubmed/37410883 http://dx.doi.org/10.1021/acs.jcim.3c00134 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Hameduh, Tareq
Mokry, Michal
Miller, Andrew D.
Heger, Zbynek
Haddad, Yazan
Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs
title Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs
title_full Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs
title_fullStr Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs
title_full_unstemmed Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs
title_short Solvent Accessibility Promotes Rotamer Errors during Protein Modeling with Major Side-Chain Prediction Programs
title_sort solvent accessibility promotes rotamer errors during protein modeling with major side-chain prediction programs
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369486/
https://www.ncbi.nlm.nih.gov/pubmed/37410883
http://dx.doi.org/10.1021/acs.jcim.3c00134
work_keys_str_mv AT hameduhtareq solventaccessibilitypromotesrotamererrorsduringproteinmodelingwithmajorsidechainpredictionprograms
AT mokrymichal solventaccessibilitypromotesrotamererrorsduringproteinmodelingwithmajorsidechainpredictionprograms
AT millerandrewd solventaccessibilitypromotesrotamererrorsduringproteinmodelingwithmajorsidechainpredictionprograms
AT hegerzbynek solventaccessibilitypromotesrotamererrorsduringproteinmodelingwithmajorsidechainpredictionprograms
AT haddadyazan solventaccessibilitypromotesrotamererrorsduringproteinmodelingwithmajorsidechainpredictionprograms