Cargando…

PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information

MOTIVATION: Phosphorylation is one of the most studied post-translational modifications, which plays a pivotal role in various cellular processes. Recently, deep learning methods have achieved great success in prediction of phosphorylation sites, but most of them are based on convolutional neural ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Hangyuan, Wang, Minghui, Liu, Xia, Zhao, Xing-Ming, Li, Ao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665744/
https://www.ncbi.nlm.nih.gov/pubmed/34320631
http://dx.doi.org/10.1093/bioinformatics/btab551
_version_ 1784614072308727808
author Yang, Hangyuan
Wang, Minghui
Liu, Xia
Zhao, Xing-Ming
Li, Ao
author_facet Yang, Hangyuan
Wang, Minghui
Liu, Xia
Zhao, Xing-Ming
Li, Ao
author_sort Yang, Hangyuan
collection PubMed
description MOTIVATION: Phosphorylation is one of the most studied post-translational modifications, which plays a pivotal role in various cellular processes. Recently, deep learning methods have achieved great success in prediction of phosphorylation sites, but most of them are based on convolutional neural network that may not capture enough information about long-range dependencies between residues in a protein sequence. In addition, existing deep learning methods only make use of sequence information for predicting phosphorylation sites, and it is highly desirable to develop a deep learning architecture that can combine heterogeneous sequence and protein–protein interaction (PPI) information for more accurate phosphorylation site prediction. RESULTS: We present a novel integrated deep neural network named PhosIDN, for phosphorylation site prediction by extracting and combining sequence and PPI information. In PhosIDN, a sequence feature encoding sub-network is proposed to capture not only local patterns but also long-range dependencies from protein sequences. Meanwhile, useful PPI features are also extracted in PhosIDN by a PPI feature encoding sub-network adopting a multi-layer deep neural network. Moreover, to effectively combine sequence and PPI information, a heterogeneous feature combination sub-network is introduced to fully exploit the complex associations between sequence and PPI features, and their combined features are used for final prediction. Comprehensive experiment results demonstrate that the proposed PhosIDN significantly improves the prediction performance of phosphorylation sites and compares favorably with existing general and kinase-specific phosphorylation site prediction methods. AVAILABILITY AND IMPLEMENTATION: PhosIDN is freely available at https://github.com/ustchangyuanyang/PhosIDN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8665744
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86657442021-12-13 PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information Yang, Hangyuan Wang, Minghui Liu, Xia Zhao, Xing-Ming Li, Ao Bioinformatics Original Papers MOTIVATION: Phosphorylation is one of the most studied post-translational modifications, which plays a pivotal role in various cellular processes. Recently, deep learning methods have achieved great success in prediction of phosphorylation sites, but most of them are based on convolutional neural network that may not capture enough information about long-range dependencies between residues in a protein sequence. In addition, existing deep learning methods only make use of sequence information for predicting phosphorylation sites, and it is highly desirable to develop a deep learning architecture that can combine heterogeneous sequence and protein–protein interaction (PPI) information for more accurate phosphorylation site prediction. RESULTS: We present a novel integrated deep neural network named PhosIDN, for phosphorylation site prediction by extracting and combining sequence and PPI information. In PhosIDN, a sequence feature encoding sub-network is proposed to capture not only local patterns but also long-range dependencies from protein sequences. Meanwhile, useful PPI features are also extracted in PhosIDN by a PPI feature encoding sub-network adopting a multi-layer deep neural network. Moreover, to effectively combine sequence and PPI information, a heterogeneous feature combination sub-network is introduced to fully exploit the complex associations between sequence and PPI features, and their combined features are used for final prediction. Comprehensive experiment results demonstrate that the proposed PhosIDN significantly improves the prediction performance of phosphorylation sites and compares favorably with existing general and kinase-specific phosphorylation site prediction methods. AVAILABILITY AND IMPLEMENTATION: PhosIDN is freely available at https://github.com/ustchangyuanyang/PhosIDN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-28 /pmc/articles/PMC8665744/ /pubmed/34320631 http://dx.doi.org/10.1093/bioinformatics/btab551 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Yang, Hangyuan
Wang, Minghui
Liu, Xia
Zhao, Xing-Ming
Li, Ao
PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information
title PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information
title_full PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information
title_fullStr PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information
title_full_unstemmed PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information
title_short PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information
title_sort phosidn: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein–protein interaction information
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665744/
https://www.ncbi.nlm.nih.gov/pubmed/34320631
http://dx.doi.org/10.1093/bioinformatics/btab551
work_keys_str_mv AT yanghangyuan phosidnanintegrateddeepneuralnetworkforimprovingproteinphosphorylationsitepredictionbycombiningsequenceandproteinproteininteractioninformation
AT wangminghui phosidnanintegrateddeepneuralnetworkforimprovingproteinphosphorylationsitepredictionbycombiningsequenceandproteinproteininteractioninformation
AT liuxia phosidnanintegrateddeepneuralnetworkforimprovingproteinphosphorylationsitepredictionbycombiningsequenceandproteinproteininteractioninformation
AT zhaoxingming phosidnanintegrateddeepneuralnetworkforimprovingproteinphosphorylationsitepredictionbycombiningsequenceandproteinproteininteractioninformation
AT liao phosidnanintegrateddeepneuralnetworkforimprovingproteinphosphorylationsitepredictionbycombiningsequenceandproteinproteininteractioninformation