Cargando…

Predicting MoRFs in protein sequences using HMM profiles

BACKGROUND: Intrinsically Disordered Proteins (IDPs) lack an ordered three-dimensional structure and are enriched in various biological processes. The Molecular Recognition Features (MoRFs) are functional regions within IDPs that undergo a disorder-to-order transition on binding to a partner protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharma, Ronesh, Kumar, Shiu, Tsunoda, Tatsuhiko, Patil, Ashwini, Sharma, Alok
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259822/
https://www.ncbi.nlm.nih.gov/pubmed/28155710
http://dx.doi.org/10.1186/s12859-016-1375-0
_version_ 1782499280643162112
author Sharma, Ronesh
Kumar, Shiu
Tsunoda, Tatsuhiko
Patil, Ashwini
Sharma, Alok
author_facet Sharma, Ronesh
Kumar, Shiu
Tsunoda, Tatsuhiko
Patil, Ashwini
Sharma, Alok
author_sort Sharma, Ronesh
collection PubMed
description BACKGROUND: Intrinsically Disordered Proteins (IDPs) lack an ordered three-dimensional structure and are enriched in various biological processes. The Molecular Recognition Features (MoRFs) are functional regions within IDPs that undergo a disorder-to-order transition on binding to a partner protein. Identifying MoRFs in IDPs using computational methods is a challenging task. METHODS: In this study, we introduce hidden Markov model (HMM) profiles to accurately identify the location of MoRFs in disordered protein sequences. Using windowing technique, HMM profiles are utilised to extract features from protein sequences and support vector machines (SVM) are used to calculate a propensity score for each residue. Two different SVM kernels with high noise tolerance are evaluated with a varying window size and the scores of the SVM models are combined to generate the final propensity score to predict MoRF residues. The SVM models are designed to extract maximal information between MoRF residues, its neighboring regions (Flanks) and the remainder of the sequence (Others). RESULTS: To evaluate the proposed method, its performance was compared to that of other MoRF predictors; MoRFpred and ANCHOR. The results show that the proposed method outperforms these two predictors. CONCLUSIONS: Using HMM profile as a source of feature extraction, the proposed method indicates improvement in predicting MoRFs in disordered protein sequences.
format Online
Article
Text
id pubmed-5259822
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52598222017-01-26 Predicting MoRFs in protein sequences using HMM profiles Sharma, Ronesh Kumar, Shiu Tsunoda, Tatsuhiko Patil, Ashwini Sharma, Alok BMC Bioinformatics Research BACKGROUND: Intrinsically Disordered Proteins (IDPs) lack an ordered three-dimensional structure and are enriched in various biological processes. The Molecular Recognition Features (MoRFs) are functional regions within IDPs that undergo a disorder-to-order transition on binding to a partner protein. Identifying MoRFs in IDPs using computational methods is a challenging task. METHODS: In this study, we introduce hidden Markov model (HMM) profiles to accurately identify the location of MoRFs in disordered protein sequences. Using windowing technique, HMM profiles are utilised to extract features from protein sequences and support vector machines (SVM) are used to calculate a propensity score for each residue. Two different SVM kernels with high noise tolerance are evaluated with a varying window size and the scores of the SVM models are combined to generate the final propensity score to predict MoRF residues. The SVM models are designed to extract maximal information between MoRF residues, its neighboring regions (Flanks) and the remainder of the sequence (Others). RESULTS: To evaluate the proposed method, its performance was compared to that of other MoRF predictors; MoRFpred and ANCHOR. The results show that the proposed method outperforms these two predictors. CONCLUSIONS: Using HMM profile as a source of feature extraction, the proposed method indicates improvement in predicting MoRFs in disordered protein sequences. BioMed Central 2016-12-22 /pmc/articles/PMC5259822/ /pubmed/28155710 http://dx.doi.org/10.1186/s12859-016-1375-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Sharma, Ronesh
Kumar, Shiu
Tsunoda, Tatsuhiko
Patil, Ashwini
Sharma, Alok
Predicting MoRFs in protein sequences using HMM profiles
title Predicting MoRFs in protein sequences using HMM profiles
title_full Predicting MoRFs in protein sequences using HMM profiles
title_fullStr Predicting MoRFs in protein sequences using HMM profiles
title_full_unstemmed Predicting MoRFs in protein sequences using HMM profiles
title_short Predicting MoRFs in protein sequences using HMM profiles
title_sort predicting morfs in protein sequences using hmm profiles
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259822/
https://www.ncbi.nlm.nih.gov/pubmed/28155710
http://dx.doi.org/10.1186/s12859-016-1375-0
work_keys_str_mv AT sharmaronesh predictingmorfsinproteinsequencesusinghmmprofiles
AT kumarshiu predictingmorfsinproteinsequencesusinghmmprofiles
AT tsunodatatsuhiko predictingmorfsinproteinsequencesusinghmmprofiles
AT patilashwini predictingmorfsinproteinsequencesusinghmmprofiles
AT sharmaalok predictingmorfsinproteinsequencesusinghmmprofiles