Cargando…

A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences

Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xue, Wu, Yuejin, Wang, Rujing, Wei, Yuanyuan, Gui, Yuanmiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555512/
https://www.ncbi.nlm.nih.gov/pubmed/31173605
http://dx.doi.org/10.1371/journal.pone.0217312
_version_ 1783425167162081280
author Wang, Xue
Wu, Yuejin
Wang, Rujing
Wei, Yuanyuan
Gui, Yuanmiao
author_facet Wang, Xue
Wu, Yuejin
Wang, Rujing
Wei, Yuanyuan
Gui, Yuanmiao
author_sort Wang, Xue
collection PubMed
description Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of these methods is particularly time consuming. To solve this issue, we have proposed a novel matrix sequence coding method. Based on deep neural network (DNN) and a novel matrix protein sequence descriptor, we constructed a protein interaction prediction model for predicting PPIs. When performed on human PPIs data, the method achieved an accuracy of 94.34%, a recall of 98.28%, an area under the curve (AUC) of 97.79% and a loss of 23.25%. A non-redundant dataset was used to evaluate this prediction model, and the prediction accuracy is 88.29%. These results indicate that the matrix of sequence (MOS) descriptor can enhance the predictive power of PPIs and reduce training time, which can be a useful complement for future proteomics research. The experimental code and experimental results can be found at https://github.com/smalltalkman/hppi-tensorflow.
format Online
Article
Text
id pubmed-6555512
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65555122019-06-17 A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences Wang, Xue Wu, Yuejin Wang, Rujing Wei, Yuanyuan Gui, Yuanmiao PLoS One Research Article Protein-protein interactions (PPIs) play an important role in the life activities of organisms. With the availability of large amounts of protein sequence data, PPIs prediction methods have attracted increasing attention. A variety of protein sequence coding methods have emerged, but the training of these methods is particularly time consuming. To solve this issue, we have proposed a novel matrix sequence coding method. Based on deep neural network (DNN) and a novel matrix protein sequence descriptor, we constructed a protein interaction prediction model for predicting PPIs. When performed on human PPIs data, the method achieved an accuracy of 94.34%, a recall of 98.28%, an area under the curve (AUC) of 97.79% and a loss of 23.25%. A non-redundant dataset was used to evaluate this prediction model, and the prediction accuracy is 88.29%. These results indicate that the matrix of sequence (MOS) descriptor can enhance the predictive power of PPIs and reduce training time, which can be a useful complement for future proteomics research. The experimental code and experimental results can be found at https://github.com/smalltalkman/hppi-tensorflow. Public Library of Science 2019-06-07 /pmc/articles/PMC6555512/ /pubmed/31173605 http://dx.doi.org/10.1371/journal.pone.0217312 Text en © 2019 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Xue
Wu, Yuejin
Wang, Rujing
Wei, Yuanyuan
Gui, Yuanmiao
A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
title A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
title_full A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
title_fullStr A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
title_full_unstemmed A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
title_short A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
title_sort novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555512/
https://www.ncbi.nlm.nih.gov/pubmed/31173605
http://dx.doi.org/10.1371/journal.pone.0217312
work_keys_str_mv AT wangxue anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT wuyuejin anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT wangrujing anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT weiyuanyuan anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT guiyuanmiao anovelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT wangxue novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT wuyuejin novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT wangrujing novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT weiyuanyuan novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences
AT guiyuanmiao novelmatrixofsequencedescriptorsforpredictingproteinproteininteractionsfromaminoacidsequences