Cargando…

Prediction of protein self-interactions using stacked long short-term memory from protein sequences information

BACKGROUND: Self-interacting Proteins (SIPs) plays a critical role in a series of life function in most living cells. Researches on SIPs are important part of molecular biology. Although numerous SIPs data be provided, traditional experimental methods are labor-intensive, time-consuming and costly a...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yan-Bin, You, Zhu-Hong, Li, Xiao, Jiang, Tong-Hai, Cheng, Li, Chen, Zhan-Heng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302371/
https://www.ncbi.nlm.nih.gov/pubmed/30577794
http://dx.doi.org/10.1186/s12918-018-0647-x
_version_ 1783381963284938752
author Wang, Yan-Bin
You, Zhu-Hong
Li, Xiao
Jiang, Tong-Hai
Cheng, Li
Chen, Zhan-Heng
author_facet Wang, Yan-Bin
You, Zhu-Hong
Li, Xiao
Jiang, Tong-Hai
Cheng, Li
Chen, Zhan-Heng
author_sort Wang, Yan-Bin
collection PubMed
description BACKGROUND: Self-interacting Proteins (SIPs) plays a critical role in a series of life function in most living cells. Researches on SIPs are important part of molecular biology. Although numerous SIPs data be provided, traditional experimental methods are labor-intensive, time-consuming and costly and can only yield limited results in real-world needs. Hence,it’s urgent to develop an efficient computational SIPs prediction method to fill the gap. Deep learning technologies have proven to produce subversive performance improvements in many areas, but the effectiveness of deep learning methods for SIPs prediction has not been verified. RESULTS: We developed a deep learning model for predicting SIPs by constructing a Stacked Long Short-Term Memory (SLSTM) neural network that contains “dropout”. We extracted features from protein sequences using a novel feature extraction scheme that combined Zernike Moments (ZMs) with Position Specific Weight Matrix (PSWM). The capability of the proposed approach was assessed on S.erevisiae and Human SIPs datasets. The result indicates that the approach based on deep learning can effectively resist data skew and achieve good accuracies of 95.69 and 97.88%, respectively. To demonstrate the progressiveness of deep learning, we compared the results of the SLSTM-based method and the celebrated Support Vector Machine (SVM) method and several other well-known methods on the same datasets. CONCLUSION: The results show that our method is overall superior to any of the other existing state-of-the-art techniques. As far as we know, this study first applies deep learning method to predict SIPs, and practical experimental results reveal its potential in SIPs identification.
format Online
Article
Text
id pubmed-6302371
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63023712018-12-31 Prediction of protein self-interactions using stacked long short-term memory from protein sequences information Wang, Yan-Bin You, Zhu-Hong Li, Xiao Jiang, Tong-Hai Cheng, Li Chen, Zhan-Heng BMC Syst Biol Research BACKGROUND: Self-interacting Proteins (SIPs) plays a critical role in a series of life function in most living cells. Researches on SIPs are important part of molecular biology. Although numerous SIPs data be provided, traditional experimental methods are labor-intensive, time-consuming and costly and can only yield limited results in real-world needs. Hence,it’s urgent to develop an efficient computational SIPs prediction method to fill the gap. Deep learning technologies have proven to produce subversive performance improvements in many areas, but the effectiveness of deep learning methods for SIPs prediction has not been verified. RESULTS: We developed a deep learning model for predicting SIPs by constructing a Stacked Long Short-Term Memory (SLSTM) neural network that contains “dropout”. We extracted features from protein sequences using a novel feature extraction scheme that combined Zernike Moments (ZMs) with Position Specific Weight Matrix (PSWM). The capability of the proposed approach was assessed on S.erevisiae and Human SIPs datasets. The result indicates that the approach based on deep learning can effectively resist data skew and achieve good accuracies of 95.69 and 97.88%, respectively. To demonstrate the progressiveness of deep learning, we compared the results of the SLSTM-based method and the celebrated Support Vector Machine (SVM) method and several other well-known methods on the same datasets. CONCLUSION: The results show that our method is overall superior to any of the other existing state-of-the-art techniques. As far as we know, this study first applies deep learning method to predict SIPs, and practical experimental results reveal its potential in SIPs identification. BioMed Central 2018-12-21 /pmc/articles/PMC6302371/ /pubmed/30577794 http://dx.doi.org/10.1186/s12918-018-0647-x Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Wang, Yan-Bin
You, Zhu-Hong
Li, Xiao
Jiang, Tong-Hai
Cheng, Li
Chen, Zhan-Heng
Prediction of protein self-interactions using stacked long short-term memory from protein sequences information
title Prediction of protein self-interactions using stacked long short-term memory from protein sequences information
title_full Prediction of protein self-interactions using stacked long short-term memory from protein sequences information
title_fullStr Prediction of protein self-interactions using stacked long short-term memory from protein sequences information
title_full_unstemmed Prediction of protein self-interactions using stacked long short-term memory from protein sequences information
title_short Prediction of protein self-interactions using stacked long short-term memory from protein sequences information
title_sort prediction of protein self-interactions using stacked long short-term memory from protein sequences information
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302371/
https://www.ncbi.nlm.nih.gov/pubmed/30577794
http://dx.doi.org/10.1186/s12918-018-0647-x
work_keys_str_mv AT wangyanbin predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation
AT youzhuhong predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation
AT lixiao predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation
AT jiangtonghai predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation
AT chengli predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation
AT chenzhanheng predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation