Cargando…

A high‐throughput predictive method for sequence‐similar fold switchers

Although most experimentally characterized proteins with similar sequences assume the same folds and perform similar functions, an increasing number of exceptions is emerging. One class of exceptions comprises sequence‐similar fold switchers, whose secondary structures shift from α‐helix <‐> β...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Allen K., Looger, Loren L., Porter, Lauren L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404102/
https://www.ncbi.nlm.nih.gov/pubmed/33462801
http://dx.doi.org/10.1002/bip.23416
_version_ 1783746103306354688
author Kim, Allen K.
Looger, Loren L.
Porter, Lauren L.
author_facet Kim, Allen K.
Looger, Loren L.
Porter, Lauren L.
author_sort Kim, Allen K.
collection PubMed
description Although most experimentally characterized proteins with similar sequences assume the same folds and perform similar functions, an increasing number of exceptions is emerging. One class of exceptions comprises sequence‐similar fold switchers, whose secondary structures shift from α‐helix <‐> β‐sheet through a small number of mutations, a sequence insertion, or a deletion. Predictive methods for identifying sequence‐similar fold switchers are desirable because some are associated with disease and/or can perform different functions in cells. Here, we use homology‐based secondary structure predictions to identify sequence‐similar fold switchers from their amino acid sequences alone. To do this, we predicted the secondary structures of sequence‐similar fold switchers using three different homology‐based secondary structure predictors: PSIPRED, JPred4, and SPIDER3. We found that α‐helix <‐> β‐strand prediction discrepancies from JPred4 discriminated between the different conformations of sequence‐similar fold switchers with high statistical significance (P < 1.8*10(−19)). Thus, we used these discrepancies as a classifier and found that they can often robustly discriminate between sequence‐similar fold switchers and sequence‐similar proteins that maintain the same folds (Matthews Correlation Coefficient of 0.82). We found that JPred4 is a more robust predictor of sequence‐similar fold switchers because of (a) the curated sequence database it uses to produce multiple sequence alignments and (b) its use of sequence profiles based on Hidden Markov Models. Our results indicate that inconsistencies between JPred4 secondary structure predictions can be used to identify some sequence‐similar fold switchers from their sequences alone. Thus, the negative information from inconsistent secondary structure predictions can potentially be leveraged to identify sequence‐similar fold switchers from the broad base of genomic sequences.
format Online
Article
Text
id pubmed-8404102
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-84041022021-11-22 A high‐throughput predictive method for sequence‐similar fold switchers Kim, Allen K. Looger, Loren L. Porter, Lauren L. Biopolymers Articles Although most experimentally characterized proteins with similar sequences assume the same folds and perform similar functions, an increasing number of exceptions is emerging. One class of exceptions comprises sequence‐similar fold switchers, whose secondary structures shift from α‐helix <‐> β‐sheet through a small number of mutations, a sequence insertion, or a deletion. Predictive methods for identifying sequence‐similar fold switchers are desirable because some are associated with disease and/or can perform different functions in cells. Here, we use homology‐based secondary structure predictions to identify sequence‐similar fold switchers from their amino acid sequences alone. To do this, we predicted the secondary structures of sequence‐similar fold switchers using three different homology‐based secondary structure predictors: PSIPRED, JPred4, and SPIDER3. We found that α‐helix <‐> β‐strand prediction discrepancies from JPred4 discriminated between the different conformations of sequence‐similar fold switchers with high statistical significance (P < 1.8*10(−19)). Thus, we used these discrepancies as a classifier and found that they can often robustly discriminate between sequence‐similar fold switchers and sequence‐similar proteins that maintain the same folds (Matthews Correlation Coefficient of 0.82). We found that JPred4 is a more robust predictor of sequence‐similar fold switchers because of (a) the curated sequence database it uses to produce multiple sequence alignments and (b) its use of sequence profiles based on Hidden Markov Models. Our results indicate that inconsistencies between JPred4 secondary structure predictions can be used to identify some sequence‐similar fold switchers from their sequences alone. Thus, the negative information from inconsistent secondary structure predictions can potentially be leveraged to identify sequence‐similar fold switchers from the broad base of genomic sequences. John Wiley & Sons, Inc. 2021-01-19 2021-10 /pmc/articles/PMC8404102/ /pubmed/33462801 http://dx.doi.org/10.1002/bip.23416 Text en © 2021 The Authors. Biopolymers published by Wiley Periodicals LLC. This article has been contributed to by US Government employees and their work is in the public domain in the USA. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Articles
Kim, Allen K.
Looger, Loren L.
Porter, Lauren L.
A high‐throughput predictive method for sequence‐similar fold switchers
title A high‐throughput predictive method for sequence‐similar fold switchers
title_full A high‐throughput predictive method for sequence‐similar fold switchers
title_fullStr A high‐throughput predictive method for sequence‐similar fold switchers
title_full_unstemmed A high‐throughput predictive method for sequence‐similar fold switchers
title_short A high‐throughput predictive method for sequence‐similar fold switchers
title_sort high‐throughput predictive method for sequence‐similar fold switchers
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404102/
https://www.ncbi.nlm.nih.gov/pubmed/33462801
http://dx.doi.org/10.1002/bip.23416
work_keys_str_mv AT kimallenk ahighthroughputpredictivemethodforsequencesimilarfoldswitchers
AT loogerlorenl ahighthroughputpredictivemethodforsequencesimilarfoldswitchers
AT porterlaurenl ahighthroughputpredictivemethodforsequencesimilarfoldswitchers
AT kimallenk highthroughputpredictivemethodforsequencesimilarfoldswitchers
AT loogerlorenl highthroughputpredictivemethodforsequencesimilarfoldswitchers
AT porterlaurenl highthroughputpredictivemethodforsequencesimilarfoldswitchers