Cargando…

A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine

The identification of drug target proteins (IDTP) plays a critical role in biometrics. The aim of this study was to retrieve potential drug target proteins (DTPs) from a collected protein dataset, which represents an overwhelming task of great significance. Previously reported methodologies for this...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Qi, Feng, YangHe, Huang, JinCai, Wang, TengJiao, Cheng, GuangQuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5409512/
https://www.ncbi.nlm.nih.gov/pubmed/28453576
http://dx.doi.org/10.1371/journal.pone.0176486
_version_ 1783232481431912448
author Wang, Qi
Feng, YangHe
Huang, JinCai
Wang, TengJiao
Cheng, GuangQuan
author_facet Wang, Qi
Feng, YangHe
Huang, JinCai
Wang, TengJiao
Cheng, GuangQuan
author_sort Wang, Qi
collection PubMed
description The identification of drug target proteins (IDTP) plays a critical role in biometrics. The aim of this study was to retrieve potential drug target proteins (DTPs) from a collected protein dataset, which represents an overwhelming task of great significance. Previously reported methodologies for this task generally employ protein-protein interactive networks but neglect informative biochemical attributes. We formulated a novel framework utilizing biochemical attributes to address this problem. In the framework, a biased support vector machine (BSVM) was combined with the deep embedded representation extracted using a deep learning model, stacked auto-encoders (SAEs). In cases of non-drug target proteins (NDTPs) contaminated by DTPs, the framework is beneficial due to the efficient representation of the SAE and relief of the imbalance effect by the BSVM. The experimental results demonstrated the effectiveness of our framework, and the generalization capability was confirmed via comparisons to other models. This study is the first to exploit a deep learning model for IDTP. In summary, nearly 23% of the NDTPs were predicted as likely DTPs, which are awaiting further verification based on biomedical experiments.
format Online
Article
Text
id pubmed-5409512
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54095122017-05-12 A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine Wang, Qi Feng, YangHe Huang, JinCai Wang, TengJiao Cheng, GuangQuan PLoS One Research Article The identification of drug target proteins (IDTP) plays a critical role in biometrics. The aim of this study was to retrieve potential drug target proteins (DTPs) from a collected protein dataset, which represents an overwhelming task of great significance. Previously reported methodologies for this task generally employ protein-protein interactive networks but neglect informative biochemical attributes. We formulated a novel framework utilizing biochemical attributes to address this problem. In the framework, a biased support vector machine (BSVM) was combined with the deep embedded representation extracted using a deep learning model, stacked auto-encoders (SAEs). In cases of non-drug target proteins (NDTPs) contaminated by DTPs, the framework is beneficial due to the efficient representation of the SAE and relief of the imbalance effect by the BSVM. The experimental results demonstrated the effectiveness of our framework, and the generalization capability was confirmed via comparisons to other models. This study is the first to exploit a deep learning model for IDTP. In summary, nearly 23% of the NDTPs were predicted as likely DTPs, which are awaiting further verification based on biomedical experiments. Public Library of Science 2017-04-28 /pmc/articles/PMC5409512/ /pubmed/28453576 http://dx.doi.org/10.1371/journal.pone.0176486 Text en © 2017 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Qi
Feng, YangHe
Huang, JinCai
Wang, TengJiao
Cheng, GuangQuan
A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine
title A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine
title_full A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine
title_fullStr A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine
title_full_unstemmed A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine
title_short A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine
title_sort novel framework for the identification of drug target proteins: combining stacked auto-encoders with a biased support vector machine
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5409512/
https://www.ncbi.nlm.nih.gov/pubmed/28453576
http://dx.doi.org/10.1371/journal.pone.0176486
work_keys_str_mv AT wangqi anovelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT fengyanghe anovelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT huangjincai anovelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT wangtengjiao anovelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT chengguangquan anovelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT wangqi novelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT fengyanghe novelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT huangjincai novelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT wangtengjiao novelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine
AT chengguangquan novelframeworkfortheidentificationofdrugtargetproteinscombiningstackedautoencoderswithabiasedsupportvectormachine