Cargando…

Computational prediction of MoRFs based on protein sequences and minimax probability machine

BACKGROUND: Molecular recognition features (MoRFs) are one important type of disordered segments that can promote specific protein-protein interactions. They are located within longer intrinsically disordered regions (IDRs), and undergo disorder-to-order transitions upon binding to their interaction...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Hao, Zhao, Jiaxiang, Sun, Guiling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6819637/
https://www.ncbi.nlm.nih.gov/pubmed/31660849
http://dx.doi.org/10.1186/s12859-019-3111-z
_version_ 1783463780664999936
author He, Hao
Zhao, Jiaxiang
Sun, Guiling
author_facet He, Hao
Zhao, Jiaxiang
Sun, Guiling
author_sort He, Hao
collection PubMed
description BACKGROUND: Molecular recognition features (MoRFs) are one important type of disordered segments that can promote specific protein-protein interactions. They are located within longer intrinsically disordered regions (IDRs), and undergo disorder-to-order transitions upon binding to their interaction partners. The functional importance of MoRFs and the limitation of experimental identification make it necessary to predict MoRFs accurately with computational methods. RESULTS: In this study, a new sequence-based method, named as MoRF(MPM), is proposed for predicting MoRFs. MoRF(MPM) uses minimax probability machine (MPM) to predict MoRFs based on 16 features and 3 different windows, which neither relying on other predictors nor calculating the properties of the surrounding regions of MoRFs separately. Comparing with ANCHOR, MoRFpred and MoRF(CHiBi) on the same test sets, MoRF(MPM) not only obtains higher AUC, but also obtains higher TPR at low FPR. CONCLUSIONS: The features used in MoRF(MPM) can effectively predict MoRFs, especially after preprocessing. Besides, MoRF(MPM) uses a linear classification algorithm and does not rely on results of other predictors which makes it accessible and repeatable.
format Online
Article
Text
id pubmed-6819637
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68196372019-10-31 Computational prediction of MoRFs based on protein sequences and minimax probability machine He, Hao Zhao, Jiaxiang Sun, Guiling BMC Bioinformatics Methodology Article BACKGROUND: Molecular recognition features (MoRFs) are one important type of disordered segments that can promote specific protein-protein interactions. They are located within longer intrinsically disordered regions (IDRs), and undergo disorder-to-order transitions upon binding to their interaction partners. The functional importance of MoRFs and the limitation of experimental identification make it necessary to predict MoRFs accurately with computational methods. RESULTS: In this study, a new sequence-based method, named as MoRF(MPM), is proposed for predicting MoRFs. MoRF(MPM) uses minimax probability machine (MPM) to predict MoRFs based on 16 features and 3 different windows, which neither relying on other predictors nor calculating the properties of the surrounding regions of MoRFs separately. Comparing with ANCHOR, MoRFpred and MoRF(CHiBi) on the same test sets, MoRF(MPM) not only obtains higher AUC, but also obtains higher TPR at low FPR. CONCLUSIONS: The features used in MoRF(MPM) can effectively predict MoRFs, especially after preprocessing. Besides, MoRF(MPM) uses a linear classification algorithm and does not rely on results of other predictors which makes it accessible and repeatable. BioMed Central 2019-10-28 /pmc/articles/PMC6819637/ /pubmed/31660849 http://dx.doi.org/10.1186/s12859-019-3111-z Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
He, Hao
Zhao, Jiaxiang
Sun, Guiling
Computational prediction of MoRFs based on protein sequences and minimax probability machine
title Computational prediction of MoRFs based on protein sequences and minimax probability machine
title_full Computational prediction of MoRFs based on protein sequences and minimax probability machine
title_fullStr Computational prediction of MoRFs based on protein sequences and minimax probability machine
title_full_unstemmed Computational prediction of MoRFs based on protein sequences and minimax probability machine
title_short Computational prediction of MoRFs based on protein sequences and minimax probability machine
title_sort computational prediction of morfs based on protein sequences and minimax probability machine
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6819637/
https://www.ncbi.nlm.nih.gov/pubmed/31660849
http://dx.doi.org/10.1186/s12859-019-3111-z
work_keys_str_mv AT hehao computationalpredictionofmorfsbasedonproteinsequencesandminimaxprobabilitymachine
AT zhaojiaxiang computationalpredictionofmorfsbasedonproteinsequencesandminimaxprobabilitymachine
AT sunguiling computationalpredictionofmorfsbasedonproteinsequencesandminimaxprobabilitymachine