Cargando…
Prediction of protein self-interactions using stacked long short-term memory from protein sequences information
BACKGROUND: Self-interacting Proteins (SIPs) plays a critical role in a series of life function in most living cells. Researches on SIPs are important part of molecular biology. Although numerous SIPs data be provided, traditional experimental methods are labor-intensive, time-consuming and costly a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302371/ https://www.ncbi.nlm.nih.gov/pubmed/30577794 http://dx.doi.org/10.1186/s12918-018-0647-x |
_version_ | 1783381963284938752 |
---|---|
author | Wang, Yan-Bin You, Zhu-Hong Li, Xiao Jiang, Tong-Hai Cheng, Li Chen, Zhan-Heng |
author_facet | Wang, Yan-Bin You, Zhu-Hong Li, Xiao Jiang, Tong-Hai Cheng, Li Chen, Zhan-Heng |
author_sort | Wang, Yan-Bin |
collection | PubMed |
description | BACKGROUND: Self-interacting Proteins (SIPs) plays a critical role in a series of life function in most living cells. Researches on SIPs are important part of molecular biology. Although numerous SIPs data be provided, traditional experimental methods are labor-intensive, time-consuming and costly and can only yield limited results in real-world needs. Hence,it’s urgent to develop an efficient computational SIPs prediction method to fill the gap. Deep learning technologies have proven to produce subversive performance improvements in many areas, but the effectiveness of deep learning methods for SIPs prediction has not been verified. RESULTS: We developed a deep learning model for predicting SIPs by constructing a Stacked Long Short-Term Memory (SLSTM) neural network that contains “dropout”. We extracted features from protein sequences using a novel feature extraction scheme that combined Zernike Moments (ZMs) with Position Specific Weight Matrix (PSWM). The capability of the proposed approach was assessed on S.erevisiae and Human SIPs datasets. The result indicates that the approach based on deep learning can effectively resist data skew and achieve good accuracies of 95.69 and 97.88%, respectively. To demonstrate the progressiveness of deep learning, we compared the results of the SLSTM-based method and the celebrated Support Vector Machine (SVM) method and several other well-known methods on the same datasets. CONCLUSION: The results show that our method is overall superior to any of the other existing state-of-the-art techniques. As far as we know, this study first applies deep learning method to predict SIPs, and practical experimental results reveal its potential in SIPs identification. |
format | Online Article Text |
id | pubmed-6302371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63023712018-12-31 Prediction of protein self-interactions using stacked long short-term memory from protein sequences information Wang, Yan-Bin You, Zhu-Hong Li, Xiao Jiang, Tong-Hai Cheng, Li Chen, Zhan-Heng BMC Syst Biol Research BACKGROUND: Self-interacting Proteins (SIPs) plays a critical role in a series of life function in most living cells. Researches on SIPs are important part of molecular biology. Although numerous SIPs data be provided, traditional experimental methods are labor-intensive, time-consuming and costly and can only yield limited results in real-world needs. Hence,it’s urgent to develop an efficient computational SIPs prediction method to fill the gap. Deep learning technologies have proven to produce subversive performance improvements in many areas, but the effectiveness of deep learning methods for SIPs prediction has not been verified. RESULTS: We developed a deep learning model for predicting SIPs by constructing a Stacked Long Short-Term Memory (SLSTM) neural network that contains “dropout”. We extracted features from protein sequences using a novel feature extraction scheme that combined Zernike Moments (ZMs) with Position Specific Weight Matrix (PSWM). The capability of the proposed approach was assessed on S.erevisiae and Human SIPs datasets. The result indicates that the approach based on deep learning can effectively resist data skew and achieve good accuracies of 95.69 and 97.88%, respectively. To demonstrate the progressiveness of deep learning, we compared the results of the SLSTM-based method and the celebrated Support Vector Machine (SVM) method and several other well-known methods on the same datasets. CONCLUSION: The results show that our method is overall superior to any of the other existing state-of-the-art techniques. As far as we know, this study first applies deep learning method to predict SIPs, and practical experimental results reveal its potential in SIPs identification. BioMed Central 2018-12-21 /pmc/articles/PMC6302371/ /pubmed/30577794 http://dx.doi.org/10.1186/s12918-018-0647-x Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Wang, Yan-Bin You, Zhu-Hong Li, Xiao Jiang, Tong-Hai Cheng, Li Chen, Zhan-Heng Prediction of protein self-interactions using stacked long short-term memory from protein sequences information |
title | Prediction of protein self-interactions using stacked long short-term memory from protein sequences information |
title_full | Prediction of protein self-interactions using stacked long short-term memory from protein sequences information |
title_fullStr | Prediction of protein self-interactions using stacked long short-term memory from protein sequences information |
title_full_unstemmed | Prediction of protein self-interactions using stacked long short-term memory from protein sequences information |
title_short | Prediction of protein self-interactions using stacked long short-term memory from protein sequences information |
title_sort | prediction of protein self-interactions using stacked long short-term memory from protein sequences information |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302371/ https://www.ncbi.nlm.nih.gov/pubmed/30577794 http://dx.doi.org/10.1186/s12918-018-0647-x |
work_keys_str_mv | AT wangyanbin predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation AT youzhuhong predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation AT lixiao predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation AT jiangtonghai predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation AT chengli predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation AT chenzhanheng predictionofproteinselfinteractionsusingstackedlongshorttermmemoryfromproteinsequencesinformation |