Cargando…

Positive-Unlabeled Learning for Pupylation Sites Prediction

Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have bee...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Ming, Cao, Jun-Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992543/
https://www.ncbi.nlm.nih.gov/pubmed/27579315
http://dx.doi.org/10.1155/2016/4525786
_version_ 1782449025840054272
author Jiang, Ming
Cao, Jun-Zhe
author_facet Jiang, Ming
Cao, Jun-Zhe
author_sort Jiang, Ming
collection PubMed
description Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have been developed to identify pupylation sites because the traditional experimental methods are time-consuming and labor-sensitive. With the existing computational methods, the experimentally annotated pupylation sites are used as the positive training set and the remaining nonannotated lysine residues as the negative training set to build classifiers to predict new pupylation sites from the unknown proteins. However, the remaining nonannotated lysine residues may contain pupylation sites which have not been experimentally validated yet. Unlike previous methods, in this study, the experimentally annotated pupylation sites were used as the positive training set whereas the remaining nonannotated lysine residues were used as the unlabeled training set. A novel method named PUL-PUP was proposed to predict pupylation sites by using positive-unlabeled learning technique. Our experimental results indicated that PUL-PUP outperforms the other methods significantly for the prediction of pupylation sites. As an application, PUL-PUP was also used to predict the most likely pupylation sites in nonannotated lysine sites.
format Online
Article
Text
id pubmed-4992543
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-49925432016-08-30 Positive-Unlabeled Learning for Pupylation Sites Prediction Jiang, Ming Cao, Jun-Zhe Biomed Res Int Research Article Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have been developed to identify pupylation sites because the traditional experimental methods are time-consuming and labor-sensitive. With the existing computational methods, the experimentally annotated pupylation sites are used as the positive training set and the remaining nonannotated lysine residues as the negative training set to build classifiers to predict new pupylation sites from the unknown proteins. However, the remaining nonannotated lysine residues may contain pupylation sites which have not been experimentally validated yet. Unlike previous methods, in this study, the experimentally annotated pupylation sites were used as the positive training set whereas the remaining nonannotated lysine residues were used as the unlabeled training set. A novel method named PUL-PUP was proposed to predict pupylation sites by using positive-unlabeled learning technique. Our experimental results indicated that PUL-PUP outperforms the other methods significantly for the prediction of pupylation sites. As an application, PUL-PUP was also used to predict the most likely pupylation sites in nonannotated lysine sites. Hindawi Publishing Corporation 2016 2016-08-07 /pmc/articles/PMC4992543/ /pubmed/27579315 http://dx.doi.org/10.1155/2016/4525786 Text en Copyright © 2016 M. Jiang and J.-Z. Cao. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jiang, Ming
Cao, Jun-Zhe
Positive-Unlabeled Learning for Pupylation Sites Prediction
title Positive-Unlabeled Learning for Pupylation Sites Prediction
title_full Positive-Unlabeled Learning for Pupylation Sites Prediction
title_fullStr Positive-Unlabeled Learning for Pupylation Sites Prediction
title_full_unstemmed Positive-Unlabeled Learning for Pupylation Sites Prediction
title_short Positive-Unlabeled Learning for Pupylation Sites Prediction
title_sort positive-unlabeled learning for pupylation sites prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992543/
https://www.ncbi.nlm.nih.gov/pubmed/27579315
http://dx.doi.org/10.1155/2016/4525786
work_keys_str_mv AT jiangming positiveunlabeledlearningforpupylationsitesprediction
AT caojunzhe positiveunlabeledlearningforpupylationsitesprediction