Cargando…
An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation
Self-interacting proteins (SIPs), whose more than two identities can interact with each other, play significant roles in the understanding of cellular process and cell functions. Although a number of experimental methods have been designed to detect the SIPs, they remain to be extremely time-consumi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6405691/ https://www.ncbi.nlm.nih.gov/pubmed/30881376 http://dx.doi.org/10.3389/fgene.2019.00090 |
_version_ | 1783401136692133888 |
---|---|
author | Chen, Zhan-Heng Li, Li-Ping He, Zhou Zhou, Ji-Ren Li, Yangming Wong, Leon |
author_facet | Chen, Zhan-Heng Li, Li-Ping He, Zhou Zhou, Ji-Ren Li, Yangming Wong, Leon |
author_sort | Chen, Zhan-Heng |
collection | PubMed |
description | Self-interacting proteins (SIPs), whose more than two identities can interact with each other, play significant roles in the understanding of cellular process and cell functions. Although a number of experimental methods have been designed to detect the SIPs, they remain to be extremely time-consuming, expensive, and challenging even nowadays. Therefore, there is an urgent need to develop the computational methods for predicting SIPs. In this study, we propose a deep forest based predictor for accurate prediction of SIPs using protein sequence information. More specifically, a novel feature representation method, which integrate position-specific scoring matrix (PSSM) with wavelet transform, is introduced. To evaluate the performance of the proposed method, cross-validation tests are performed on two widely used benchmark datasets. The experimental results show that the proposed model achieved high accuracies of 95.43 and 93.65% on human and yeast datasets, respectively. The AUC value for evaluating the performance of the proposed method was also reported. The AUC value for yeast and human datasets are 0.9203 and 0.9586, respectively. To further show the advantage of the proposed method, it is compared with several existing methods. The results demonstrate that the proposed model is better than other SIPs prediction methods. This work can offer an effective architecture to biologists in detecting new SIPs. |
format | Online Article Text |
id | pubmed-6405691 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-64056912019-03-15 An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation Chen, Zhan-Heng Li, Li-Ping He, Zhou Zhou, Ji-Ren Li, Yangming Wong, Leon Front Genet Genetics Self-interacting proteins (SIPs), whose more than two identities can interact with each other, play significant roles in the understanding of cellular process and cell functions. Although a number of experimental methods have been designed to detect the SIPs, they remain to be extremely time-consuming, expensive, and challenging even nowadays. Therefore, there is an urgent need to develop the computational methods for predicting SIPs. In this study, we propose a deep forest based predictor for accurate prediction of SIPs using protein sequence information. More specifically, a novel feature representation method, which integrate position-specific scoring matrix (PSSM) with wavelet transform, is introduced. To evaluate the performance of the proposed method, cross-validation tests are performed on two widely used benchmark datasets. The experimental results show that the proposed model achieved high accuracies of 95.43 and 93.65% on human and yeast datasets, respectively. The AUC value for evaluating the performance of the proposed method was also reported. The AUC value for yeast and human datasets are 0.9203 and 0.9586, respectively. To further show the advantage of the proposed method, it is compared with several existing methods. The results demonstrate that the proposed model is better than other SIPs prediction methods. This work can offer an effective architecture to biologists in detecting new SIPs. Frontiers Media S.A. 2019-03-01 /pmc/articles/PMC6405691/ /pubmed/30881376 http://dx.doi.org/10.3389/fgene.2019.00090 Text en Copyright © 2019 Chen, Li, He, Zhou, Li and Wong. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Chen, Zhan-Heng Li, Li-Ping He, Zhou Zhou, Ji-Ren Li, Yangming Wong, Leon An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation |
title | An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation |
title_full | An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation |
title_fullStr | An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation |
title_full_unstemmed | An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation |
title_short | An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation |
title_sort | improved deep forest model for predicting self-interacting proteins from protein sequence using wavelet transformation |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6405691/ https://www.ncbi.nlm.nih.gov/pubmed/30881376 http://dx.doi.org/10.3389/fgene.2019.00090 |
work_keys_str_mv | AT chenzhanheng animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT liliping animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT hezhou animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT zhoujiren animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT liyangming animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT wongleon animproveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT chenzhanheng improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT liliping improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT hezhou improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT zhoujiren improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT liyangming improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation AT wongleon improveddeepforestmodelforpredictingselfinteractingproteinsfromproteinsequenceusingwavelettransformation |