Cargando…
PTPD: predicting therapeutic peptides by deep learning and word2vec
*: Background In the search for therapeutic peptides for disease treatments, many efforts have been made to identify various functional peptides from large numbers of peptide sequence databases. In this paper, we propose an effective computational model that uses deep learning and word2vec to predic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6728961/ https://www.ncbi.nlm.nih.gov/pubmed/31492094 http://dx.doi.org/10.1186/s12859-019-3006-z |
_version_ | 1783449513262841856 |
---|---|
author | Wu, Chuanyan Gao, Rui Zhang, Yusen De Marinis, Yang |
author_facet | Wu, Chuanyan Gao, Rui Zhang, Yusen De Marinis, Yang |
author_sort | Wu, Chuanyan |
collection | PubMed |
description | *: Background In the search for therapeutic peptides for disease treatments, many efforts have been made to identify various functional peptides from large numbers of peptide sequence databases. In this paper, we propose an effective computational model that uses deep learning and word2vec to predict therapeutic peptides (PTPD). *: Results Representation vectors of all k-mers were obtained through word2vec based on k-mer co-existence information. The original peptide sequences were then divided into k-mers using the windowing method. The peptide sequences were mapped to the input layer by the embedding vector obtained by word2vec. Three types of filters in the convolutional layers, as well as dropout and max-pooling operations, were applied to construct feature maps. These feature maps were concatenated into a fully connected dense layer, and rectified linear units (ReLU) and dropout operations were included to avoid over-fitting of PTPD. The classification probabilities were generated by a sigmoid function. PTPD was then validated using two datasets: an independent anticancer peptide dataset and a virulent protein dataset, on which it achieved accuracies of 96% and 94%, respectively. *: Conclusions PTPD identified novel therapeutic peptides efficiently, and it is suitable for application as a useful tool in therapeutic peptide design. |
format | Online Article Text |
id | pubmed-6728961 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-67289612019-09-12 PTPD: predicting therapeutic peptides by deep learning and word2vec Wu, Chuanyan Gao, Rui Zhang, Yusen De Marinis, Yang BMC Bioinformatics Methodology Article *: Background In the search for therapeutic peptides for disease treatments, many efforts have been made to identify various functional peptides from large numbers of peptide sequence databases. In this paper, we propose an effective computational model that uses deep learning and word2vec to predict therapeutic peptides (PTPD). *: Results Representation vectors of all k-mers were obtained through word2vec based on k-mer co-existence information. The original peptide sequences were then divided into k-mers using the windowing method. The peptide sequences were mapped to the input layer by the embedding vector obtained by word2vec. Three types of filters in the convolutional layers, as well as dropout and max-pooling operations, were applied to construct feature maps. These feature maps were concatenated into a fully connected dense layer, and rectified linear units (ReLU) and dropout operations were included to avoid over-fitting of PTPD. The classification probabilities were generated by a sigmoid function. PTPD was then validated using two datasets: an independent anticancer peptide dataset and a virulent protein dataset, on which it achieved accuracies of 96% and 94%, respectively. *: Conclusions PTPD identified novel therapeutic peptides efficiently, and it is suitable for application as a useful tool in therapeutic peptide design. BioMed Central 2019-09-06 /pmc/articles/PMC6728961/ /pubmed/31492094 http://dx.doi.org/10.1186/s12859-019-3006-z Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Wu, Chuanyan Gao, Rui Zhang, Yusen De Marinis, Yang PTPD: predicting therapeutic peptides by deep learning and word2vec |
title | PTPD: predicting therapeutic peptides by deep learning and word2vec |
title_full | PTPD: predicting therapeutic peptides by deep learning and word2vec |
title_fullStr | PTPD: predicting therapeutic peptides by deep learning and word2vec |
title_full_unstemmed | PTPD: predicting therapeutic peptides by deep learning and word2vec |
title_short | PTPD: predicting therapeutic peptides by deep learning and word2vec |
title_sort | ptpd: predicting therapeutic peptides by deep learning and word2vec |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6728961/ https://www.ncbi.nlm.nih.gov/pubmed/31492094 http://dx.doi.org/10.1186/s12859-019-3006-z |
work_keys_str_mv | AT wuchuanyan ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec AT gaorui ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec AT zhangyusen ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec AT demarinisyang ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec |