Cargando…

Hot spot prediction in protein-protein interactions by an ensemble system

BACKGROUND: Hot spot residues are functional sites in protein interaction interfaces. The identification of hot spot residues is time-consuming and laborious using experimental methods. In order to address the issue, many computational methods have been developed to predict hot spot residues. Moreov...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Quanya, Chen, Peng, Wang, Bing, Zhang, Jun, Li, Jinyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311905/
https://www.ncbi.nlm.nih.gov/pubmed/30598091
http://dx.doi.org/10.1186/s12918-018-0665-8
_version_ 1783383698049073152
author Liu, Quanya
Chen, Peng
Wang, Bing
Zhang, Jun
Li, Jinyan
author_facet Liu, Quanya
Chen, Peng
Wang, Bing
Zhang, Jun
Li, Jinyan
author_sort Liu, Quanya
collection PubMed
description BACKGROUND: Hot spot residues are functional sites in protein interaction interfaces. The identification of hot spot residues is time-consuming and laborious using experimental methods. In order to address the issue, many computational methods have been developed to predict hot spot residues. Moreover, most prediction methods are based on structural features, sequence characteristics, and/or other protein features. RESULTS: This paper proposed an ensemble learning method to predict hot spot residues that only uses sequence features and the relative accessible surface area of amino acid sequences. In this work, a novel feature selection technique was developed, an auto-correlation function combined with a sliding window technique was applied to obtain the characteristics of amino acid residues in protein sequence, and an ensemble classifier with SVM and KNN base classifiers was built to achieve the best classification performance. CONCLUSION: The experimental results showed that our model yields the highest F1 score of 0.92 and an MCC value of 0.87 on ASEdb dataset. Compared with other machine learning methods, our model achieves a big improvement in hot spot prediction. AVAILABILITY: http://deeplearner.ahu.edu.cn/web/HotspotEL.htm. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12918-018-0665-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6311905
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63119052019-01-07 Hot spot prediction in protein-protein interactions by an ensemble system Liu, Quanya Chen, Peng Wang, Bing Zhang, Jun Li, Jinyan BMC Syst Biol Research BACKGROUND: Hot spot residues are functional sites in protein interaction interfaces. The identification of hot spot residues is time-consuming and laborious using experimental methods. In order to address the issue, many computational methods have been developed to predict hot spot residues. Moreover, most prediction methods are based on structural features, sequence characteristics, and/or other protein features. RESULTS: This paper proposed an ensemble learning method to predict hot spot residues that only uses sequence features and the relative accessible surface area of amino acid sequences. In this work, a novel feature selection technique was developed, an auto-correlation function combined with a sliding window technique was applied to obtain the characteristics of amino acid residues in protein sequence, and an ensemble classifier with SVM and KNN base classifiers was built to achieve the best classification performance. CONCLUSION: The experimental results showed that our model yields the highest F1 score of 0.92 and an MCC value of 0.87 on ASEdb dataset. Compared with other machine learning methods, our model achieves a big improvement in hot spot prediction. AVAILABILITY: http://deeplearner.ahu.edu.cn/web/HotspotEL.htm. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12918-018-0665-8) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-31 /pmc/articles/PMC6311905/ /pubmed/30598091 http://dx.doi.org/10.1186/s12918-018-0665-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Liu, Quanya
Chen, Peng
Wang, Bing
Zhang, Jun
Li, Jinyan
Hot spot prediction in protein-protein interactions by an ensemble system
title Hot spot prediction in protein-protein interactions by an ensemble system
title_full Hot spot prediction in protein-protein interactions by an ensemble system
title_fullStr Hot spot prediction in protein-protein interactions by an ensemble system
title_full_unstemmed Hot spot prediction in protein-protein interactions by an ensemble system
title_short Hot spot prediction in protein-protein interactions by an ensemble system
title_sort hot spot prediction in protein-protein interactions by an ensemble system
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311905/
https://www.ncbi.nlm.nih.gov/pubmed/30598091
http://dx.doi.org/10.1186/s12918-018-0665-8
work_keys_str_mv AT liuquanya hotspotpredictioninproteinproteininteractionsbyanensemblesystem
AT chenpeng hotspotpredictioninproteinproteininteractionsbyanensemblesystem
AT wangbing hotspotpredictioninproteinproteininteractionsbyanensemblesystem
AT zhangjun hotspotpredictioninproteinproteininteractionsbyanensemblesystem
AT lijinyan hotspotpredictioninproteinproteininteractionsbyanensemblesystem