Cargando…

An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation

Self-interacting proteins (SIPs), whose more than two identities can interact with each other, play significant roles in the understanding of cellular process and cell functions. Although a number of experimental methods have been designed to detect the SIPs, they remain to be extremely time-consumi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Zhan-Heng, Li, Li-Ping, He, Zhou, Zhou, Ji-Ren, Li, Yangming, Wong, Leon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6405691/
https://www.ncbi.nlm.nih.gov/pubmed/30881376
http://dx.doi.org/10.3389/fgene.2019.00090
_version_ 1783401136692133888
author Chen, Zhan-Heng
Li, Li-Ping
He, Zhou
Zhou, Ji-Ren
Li, Yangming
Wong, Leon
author_facet Chen, Zhan-Heng
Li, Li-Ping
He, Zhou
Zhou, Ji-Ren
Li, Yangming
Wong, Leon
author_sort Chen, Zhan-Heng
collection PubMed
description Self-interacting proteins (SIPs), whose more than two identities can interact with each other, play significant roles in the understanding of cellular process and cell functions. Although a number of experimental methods have been designed to detect the SIPs, they remain to be extremely time-consuming, expensive, and challenging even nowadays. Therefore, there is an urgent need to develop the computational methods for predicting SIPs. In this study, we propose a deep forest based predictor for accurate prediction of SIPs using protein sequence information. More specifically, a novel feature representation method, which integrate position-specific scoring matrix (PSSM) with wavelet transform, is introduced. To evaluate the performance of the proposed method, cross-validation tests are performed on two widely used benchmark datasets. The experimental results show that the proposed model achieved high accuracies of 95.43 and 93.65% on human and yeast datasets, respectively. The AUC value for evaluating the performance of the proposed method was also reported. The AUC value for yeast and human datasets are 0.9203 and 0.9586, respectively. To further show the advantage of the proposed method, it is compared with several existing methods. The results demonstrate that the proposed model is better than other SIPs prediction methods. This work can offer an effective architecture to biologists in detecting new SIPs.
format Online
Article
Text
id pubmed-6405691
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64056912019-03-15 An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation Chen, Zhan-Heng Li, Li-Ping He, Zhou Zhou, Ji-Ren Li, Yangming Wong, Leon Front Genet Genetics Self-interacting proteins (SIPs), whose more than two identities can interact with each other, play significant roles in the understanding of cellular process and cell functions. Although a number of experimental methods have been designed to detect the SIPs, they remain to be extremely time-consuming, expensive, and challenging even nowadays. Therefore, there is an urgent need to develop the computational methods for predicting SIPs. In this study, we propose a deep forest based predictor for accurate prediction of SIPs using protein sequence information. More specifically, a novel feature representation method, which integrate position-specific scoring matrix (PSSM) with wavelet transform, is introduced. To evaluate the performance of the proposed method, cross-validation tests are performed on two widely used benchmark datasets. The experimental results show that the proposed model achieved high accuracies of 95.43 and 93.65% on human and yeast datasets, respectively. The AUC value for evaluating the performance of the proposed method was also reported. The AUC value for yeast and human datasets are 0.9203 and 0.9586, respectively. To further show the advantage of the proposed method, it is compared with several existing methods. The results demonstrate that the proposed model is better than other SIPs prediction methods. This work can offer an effective architecture to biologists in detecting new SIPs. Frontiers Media S.A. 2019-03-01 /pmc/articles/PMC6405691/ /pubmed/30881376 http://dx.doi.org/10.3389/fgene.2019.00090 Text en Copyright © 2019 Chen, Li, He, Zhou, Li and Wong. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Chen, Zhan-Heng
Li, Li-Ping
He, Zhou
Zhou, Ji-Ren
Li, Yangming
Wong, Leon
An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation
title An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation
title_full An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation
title_fullStr An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation
title_full_unstemmed An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation
title_short An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation
title_sort improved deep forest model for predicting self-interacting proteins from protein sequence using wavelet transformation
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6405691/
https://www.ncbi.nlm.nih.gov/pubmed/30881376
http://dx.doi.org/10.3389/fgene.2019.00090
work_keys_str_mv AT chenzhanheng animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT liliping animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT hezhou animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT zhoujiren animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT liyangming animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT wongleon animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT chenzhanheng improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT liliping improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT hezhou improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT zhoujiren improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT liyangming improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation
AT wongleon improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation