Cargando…
Predicting MoRFs in protein sequences using HMM profiles
BACKGROUND: Intrinsically Disordered Proteins (IDPs) lack an ordered three-dimensional structure and are enriched in various biological processes. The Molecular Recognition Features (MoRFs) are functional regions within IDPs that undergo a disorder-to-order transition on binding to a partner protein...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259822/ https://www.ncbi.nlm.nih.gov/pubmed/28155710 http://dx.doi.org/10.1186/s12859-016-1375-0 |
_version_ | 1782499280643162112 |
---|---|
author | Sharma, Ronesh Kumar, Shiu Tsunoda, Tatsuhiko Patil, Ashwini Sharma, Alok |
author_facet | Sharma, Ronesh Kumar, Shiu Tsunoda, Tatsuhiko Patil, Ashwini Sharma, Alok |
author_sort | Sharma, Ronesh |
collection | PubMed |
description | BACKGROUND: Intrinsically Disordered Proteins (IDPs) lack an ordered three-dimensional structure and are enriched in various biological processes. The Molecular Recognition Features (MoRFs) are functional regions within IDPs that undergo a disorder-to-order transition on binding to a partner protein. Identifying MoRFs in IDPs using computational methods is a challenging task. METHODS: In this study, we introduce hidden Markov model (HMM) profiles to accurately identify the location of MoRFs in disordered protein sequences. Using windowing technique, HMM profiles are utilised to extract features from protein sequences and support vector machines (SVM) are used to calculate a propensity score for each residue. Two different SVM kernels with high noise tolerance are evaluated with a varying window size and the scores of the SVM models are combined to generate the final propensity score to predict MoRF residues. The SVM models are designed to extract maximal information between MoRF residues, its neighboring regions (Flanks) and the remainder of the sequence (Others). RESULTS: To evaluate the proposed method, its performance was compared to that of other MoRF predictors; MoRFpred and ANCHOR. The results show that the proposed method outperforms these two predictors. CONCLUSIONS: Using HMM profile as a source of feature extraction, the proposed method indicates improvement in predicting MoRFs in disordered protein sequences. |
format | Online Article Text |
id | pubmed-5259822 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52598222017-01-26 Predicting MoRFs in protein sequences using HMM profiles Sharma, Ronesh Kumar, Shiu Tsunoda, Tatsuhiko Patil, Ashwini Sharma, Alok BMC Bioinformatics Research BACKGROUND: Intrinsically Disordered Proteins (IDPs) lack an ordered three-dimensional structure and are enriched in various biological processes. The Molecular Recognition Features (MoRFs) are functional regions within IDPs that undergo a disorder-to-order transition on binding to a partner protein. Identifying MoRFs in IDPs using computational methods is a challenging task. METHODS: In this study, we introduce hidden Markov model (HMM) profiles to accurately identify the location of MoRFs in disordered protein sequences. Using windowing technique, HMM profiles are utilised to extract features from protein sequences and support vector machines (SVM) are used to calculate a propensity score for each residue. Two different SVM kernels with high noise tolerance are evaluated with a varying window size and the scores of the SVM models are combined to generate the final propensity score to predict MoRF residues. The SVM models are designed to extract maximal information between MoRF residues, its neighboring regions (Flanks) and the remainder of the sequence (Others). RESULTS: To evaluate the proposed method, its performance was compared to that of other MoRF predictors; MoRFpred and ANCHOR. The results show that the proposed method outperforms these two predictors. CONCLUSIONS: Using HMM profile as a source of feature extraction, the proposed method indicates improvement in predicting MoRFs in disordered protein sequences. BioMed Central 2016-12-22 /pmc/articles/PMC5259822/ /pubmed/28155710 http://dx.doi.org/10.1186/s12859-016-1375-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Sharma, Ronesh Kumar, Shiu Tsunoda, Tatsuhiko Patil, Ashwini Sharma, Alok Predicting MoRFs in protein sequences using HMM profiles |
title | Predicting MoRFs in protein sequences using HMM profiles |
title_full | Predicting MoRFs in protein sequences using HMM profiles |
title_fullStr | Predicting MoRFs in protein sequences using HMM profiles |
title_full_unstemmed | Predicting MoRFs in protein sequences using HMM profiles |
title_short | Predicting MoRFs in protein sequences using HMM profiles |
title_sort | predicting morfs in protein sequences using hmm profiles |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259822/ https://www.ncbi.nlm.nih.gov/pubmed/28155710 http://dx.doi.org/10.1186/s12859-016-1375-0 |
work_keys_str_mv | AT sharmaronesh predictingmorfsinproteinsequencesusinghmmprofiles AT kumarshiu predictingmorfsinproteinsequencesusinghmmprofiles AT tsunodatatsuhiko predictingmorfsinproteinsequencesusinghmmprofiles AT patilashwini predictingmorfsinproteinsequencesusinghmmprofiles AT sharmaalok predictingmorfsinproteinsequencesusinghmmprofiles |