Cargando…

PTPD: predicting therapeutic peptides by deep learning and word2vec

*: Background In the search for therapeutic peptides for disease treatments, many efforts have been made to identify various functional peptides from large numbers of peptide sequence databases. In this paper, we propose an effective computational model that uses deep learning and word2vec to predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Chuanyan, Gao, Rui, Zhang, Yusen, De Marinis, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6728961/
https://www.ncbi.nlm.nih.gov/pubmed/31492094
http://dx.doi.org/10.1186/s12859-019-3006-z
_version_ 1783449513262841856
author Wu, Chuanyan
Gao, Rui
Zhang, Yusen
De Marinis, Yang
author_facet Wu, Chuanyan
Gao, Rui
Zhang, Yusen
De Marinis, Yang
author_sort Wu, Chuanyan
collection PubMed
description *: Background In the search for therapeutic peptides for disease treatments, many efforts have been made to identify various functional peptides from large numbers of peptide sequence databases. In this paper, we propose an effective computational model that uses deep learning and word2vec to predict therapeutic peptides (PTPD). *: Results Representation vectors of all k-mers were obtained through word2vec based on k-mer co-existence information. The original peptide sequences were then divided into k-mers using the windowing method. The peptide sequences were mapped to the input layer by the embedding vector obtained by word2vec. Three types of filters in the convolutional layers, as well as dropout and max-pooling operations, were applied to construct feature maps. These feature maps were concatenated into a fully connected dense layer, and rectified linear units (ReLU) and dropout operations were included to avoid over-fitting of PTPD. The classification probabilities were generated by a sigmoid function. PTPD was then validated using two datasets: an independent anticancer peptide dataset and a virulent protein dataset, on which it achieved accuracies of 96% and 94%, respectively. *: Conclusions PTPD identified novel therapeutic peptides efficiently, and it is suitable for application as a useful tool in therapeutic peptide design.
format Online
Article
Text
id pubmed-6728961
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67289612019-09-12 PTPD: predicting therapeutic peptides by deep learning and word2vec Wu, Chuanyan Gao, Rui Zhang, Yusen De Marinis, Yang BMC Bioinformatics Methodology Article *: Background In the search for therapeutic peptides for disease treatments, many efforts have been made to identify various functional peptides from large numbers of peptide sequence databases. In this paper, we propose an effective computational model that uses deep learning and word2vec to predict therapeutic peptides (PTPD). *: Results Representation vectors of all k-mers were obtained through word2vec based on k-mer co-existence information. The original peptide sequences were then divided into k-mers using the windowing method. The peptide sequences were mapped to the input layer by the embedding vector obtained by word2vec. Three types of filters in the convolutional layers, as well as dropout and max-pooling operations, were applied to construct feature maps. These feature maps were concatenated into a fully connected dense layer, and rectified linear units (ReLU) and dropout operations were included to avoid over-fitting of PTPD. The classification probabilities were generated by a sigmoid function. PTPD was then validated using two datasets: an independent anticancer peptide dataset and a virulent protein dataset, on which it achieved accuracies of 96% and 94%, respectively. *: Conclusions PTPD identified novel therapeutic peptides efficiently, and it is suitable for application as a useful tool in therapeutic peptide design. BioMed Central 2019-09-06 /pmc/articles/PMC6728961/ /pubmed/31492094 http://dx.doi.org/10.1186/s12859-019-3006-z Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wu, Chuanyan
Gao, Rui
Zhang, Yusen
De Marinis, Yang
PTPD: predicting therapeutic peptides by deep learning and word2vec
title PTPD: predicting therapeutic peptides by deep learning and word2vec
title_full PTPD: predicting therapeutic peptides by deep learning and word2vec
title_fullStr PTPD: predicting therapeutic peptides by deep learning and word2vec
title_full_unstemmed PTPD: predicting therapeutic peptides by deep learning and word2vec
title_short PTPD: predicting therapeutic peptides by deep learning and word2vec
title_sort ptpd: predicting therapeutic peptides by deep learning and word2vec
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6728961/
https://www.ncbi.nlm.nih.gov/pubmed/31492094
http://dx.doi.org/10.1186/s12859-019-3006-z
work_keys_str_mv AT wuchuanyan ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec
AT gaorui ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec
AT zhangyusen ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec
AT demarinisyang ptpdpredictingtherapeuticpeptidesbydeeplearningandword2vec