Cargando…

MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation

BACKGROUND: Molecular recognition features (MoRFs) are short binding regions located in longer intrinsically disordered protein regions. Although these short regions lack a stable structure in the natural state, they readily undergo disorder-to-order transitions upon binding to their partner molecul...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Chun, Noguchi, Tamotsu, Tominaga, Daisuke, Yamana, Hayato
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853019/
https://www.ncbi.nlm.nih.gov/pubmed/24093637
http://dx.doi.org/10.1186/1471-2105-14-300
_version_ 1782478765906984960
author Fang, Chun
Noguchi, Tamotsu
Tominaga, Daisuke
Yamana, Hayato
author_facet Fang, Chun
Noguchi, Tamotsu
Tominaga, Daisuke
Yamana, Hayato
author_sort Fang, Chun
collection PubMed
description BACKGROUND: Molecular recognition features (MoRFs) are short binding regions located in longer intrinsically disordered protein regions. Although these short regions lack a stable structure in the natural state, they readily undergo disorder-to-order transitions upon binding to their partner molecules. MoRFs play critical roles in the molecular interaction network of a cell, and are associated with many human genetic diseases. Therefore, identification of MoRFs is an important step in understanding functional aspects of these proteins and in finding applications in drug design. RESULTS: Here, we propose a novel method for identifying MoRFs, named as MFSPSSMpred (Masked, Filtered and Smoothed Position-Specific Scoring Matrix-based Predictor). Firstly, a masking method is used to calculate the average local conservation scores of residues within a masking-window length in the position-specific scoring matrix (PSSM). Then, the scores below the average are filtered out. Finally, a smoothing method is used to incorporate the features of flanking regions for each residue to prepare the feature sets for prediction. Our method employs no predicted results from other classifiers as input, i.e., all features used in this method are extracted from the PSSM of sequence only. Experimental results show that, comparing with other methods tested on the same datasets, our method achieves the best performance: achieving 0.004~0.079 higher AUC than other methods when tested on TEST419, and achieving 0.045~0.212 higher AUC than other methods when tested on TEST2012. In addition, when tested on an independent membrane proteins-related dataset, MFSPSSMpred significantly outperformed the existing predictor MoRFpred. CONCLUSIONS: This study suggests that: 1) amino acid composition and physicochemical properties in the flanking regions of MoRFs are very different from those in the general non-MoRF regions; 2) MoRFs contain both highly conserved residues and highly variable residues and, on the whole, are highly locally conserved; and 3) combining contextual information with local conservation information of residues facilitates the prediction of MoRFs.
format Online
Article
Text
id pubmed-3853019
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38530192013-12-16 MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation Fang, Chun Noguchi, Tamotsu Tominaga, Daisuke Yamana, Hayato BMC Bioinformatics Methodology Article BACKGROUND: Molecular recognition features (MoRFs) are short binding regions located in longer intrinsically disordered protein regions. Although these short regions lack a stable structure in the natural state, they readily undergo disorder-to-order transitions upon binding to their partner molecules. MoRFs play critical roles in the molecular interaction network of a cell, and are associated with many human genetic diseases. Therefore, identification of MoRFs is an important step in understanding functional aspects of these proteins and in finding applications in drug design. RESULTS: Here, we propose a novel method for identifying MoRFs, named as MFSPSSMpred (Masked, Filtered and Smoothed Position-Specific Scoring Matrix-based Predictor). Firstly, a masking method is used to calculate the average local conservation scores of residues within a masking-window length in the position-specific scoring matrix (PSSM). Then, the scores below the average are filtered out. Finally, a smoothing method is used to incorporate the features of flanking regions for each residue to prepare the feature sets for prediction. Our method employs no predicted results from other classifiers as input, i.e., all features used in this method are extracted from the PSSM of sequence only. Experimental results show that, comparing with other methods tested on the same datasets, our method achieves the best performance: achieving 0.004~0.079 higher AUC than other methods when tested on TEST419, and achieving 0.045~0.212 higher AUC than other methods when tested on TEST2012. In addition, when tested on an independent membrane proteins-related dataset, MFSPSSMpred significantly outperformed the existing predictor MoRFpred. CONCLUSIONS: This study suggests that: 1) amino acid composition and physicochemical properties in the flanking regions of MoRFs are very different from those in the general non-MoRF regions; 2) MoRFs contain both highly conserved residues and highly variable residues and, on the whole, are highly locally conserved; and 3) combining contextual information with local conservation information of residues facilitates the prediction of MoRFs. BioMed Central 2013-10-04 /pmc/articles/PMC3853019/ /pubmed/24093637 http://dx.doi.org/10.1186/1471-2105-14-300 Text en Copyright © 2013 Fang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Fang, Chun
Noguchi, Tamotsu
Tominaga, Daisuke
Yamana, Hayato
MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation
title MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation
title_full MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation
title_fullStr MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation
title_full_unstemmed MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation
title_short MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation
title_sort mfspssmpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853019/
https://www.ncbi.nlm.nih.gov/pubmed/24093637
http://dx.doi.org/10.1186/1471-2105-14-300
work_keys_str_mv AT fangchun mfspssmpredidentifyingshortdisordertoorderbindingregionsindisorderedproteinsbasedoncontextuallocalevolutionaryconservation
AT noguchitamotsu mfspssmpredidentifyingshortdisordertoorderbindingregionsindisorderedproteinsbasedoncontextuallocalevolutionaryconservation
AT tominagadaisuke mfspssmpredidentifyingshortdisordertoorderbindingregionsindisorderedproteinsbasedoncontextuallocalevolutionaryconservation
AT yamanahayato mfspssmpredidentifyingshortdisordertoorderbindingregionsindisorderedproteinsbasedoncontextuallocalevolutionaryconservation