Cargando…
A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555512/ https://www.ncbi.nlm.nih.gov/pubmed/31173605 http://dx.doi.org/10.1371/journal.pone.0217312 |
_version_ | 1783425167162081280 |
---|---|
author | Wang, Xue Wu, Yuejin Wang, Rujing Wei, Yuanyuan Gui, Yuanmiao |
author_facet | Wang, Xue Wu, Yuejin Wang, Rujing Wei, Yuanyuan Gui, Yuanmiao |
author_sort | Wang, Xue |
collection | PubMed |
description | Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of these methods is particularly time consuming. To solve this issue, we have proposed a novel matrix sequence coding method. Based on deep neural network (DNN) and a novel matrix protein sequence descriptor, we constructed a protein interaction prediction model for predicting PPIs. When performed on human PPIs data, the method achieved an accuracy of 94.34%, a recall of 98.28%, an area under the curve (AUC) of 97.79% and a loss of 23.25%. A non-redundant dataset was used to evaluate this prediction model, and the prediction accuracy is 88.29%. These results indicate that the matrix of sequence (MOS) descriptor can enhance the predictive power of PPIs and reduce training time, which can be a useful complement for future proteomics research. The experimental code and experimental results can be found at https://github.com/smalltalkman/hppi-tensorflow. |
format | Online Article Text |
id | pubmed-6555512 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-65555122019-06-17 A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences Wang, Xue Wu, Yuejin Wang, Rujing Wei, Yuanyuan Gui, Yuanmiao PLoS One Research Article Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of these methods is particularly time consuming. To solve this issue, we have proposed a novel matrix sequence coding method. Based on deep neural network (DNN) and a novel matrix protein sequence descriptor, we constructed a protein interaction prediction model for predicting PPIs. When performed on human PPIs data, the method achieved an accuracy of 94.34%, a recall of 98.28%, an area under the curve (AUC) of 97.79% and a loss of 23.25%. A non-redundant dataset was used to evaluate this prediction model, and the prediction accuracy is 88.29%. These results indicate that the matrix of sequence (MOS) descriptor can enhance the predictive power of PPIs and reduce training time, which can be a useful complement for future proteomics research. The experimental code and experimental results can be found at https://github.com/smalltalkman/hppi-tensorflow. Public Library of Science 2019-06-07 /pmc/articles/PMC6555512/ /pubmed/31173605 http://dx.doi.org/10.1371/journal.pone.0217312 Text en © 2019 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Wang, Xue Wu, Yuejin Wang, Rujing Wei, Yuanyuan Gui, Yuanmiao A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences |
title | A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences |
title_full | A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences |
title_fullStr | A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences |
title_full_unstemmed | A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences |
title_short | A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences |
title_sort | novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555512/ https://www.ncbi.nlm.nih.gov/pubmed/31173605 http://dx.doi.org/10.1371/journal.pone.0217312 |
work_keys_str_mv | AT wangxue anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT wuyuejin anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT wangrujing anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT weiyuanyuan anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT guiyuanmiao anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT wangxue novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT wuyuejin novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT wangrujing novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT weiyuanyuan novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences AT guiyuanmiao novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences |