Cargando…
Positive-Unlabeled Learning for Pupylation Sites Prediction
Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have bee...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992543/ https://www.ncbi.nlm.nih.gov/pubmed/27579315 http://dx.doi.org/10.1155/2016/4525786 |
_version_ | 1782449025840054272 |
---|---|
author | Jiang, Ming Cao, Jun-Zhe |
author_facet | Jiang, Ming Cao, Jun-Zhe |
author_sort | Jiang, Ming |
collection | PubMed |
description | Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have been developed to identify pupylation sites because the traditional experimental methods are time-consuming and labor-sensitive. With the existing computational methods, the experimentally annotated pupylation sites are used as the positive training set and the remaining nonannotated lysine residues as the negative training set to build classifiers to predict new pupylation sites from the unknown proteins. However, the remaining nonannotated lysine residues may contain pupylation sites which have not been experimentally validated yet. Unlike previous methods, in this study, the experimentally annotated pupylation sites were used as the positive training set whereas the remaining nonannotated lysine residues were used as the unlabeled training set. A novel method named PUL-PUP was proposed to predict pupylation sites by using positive-unlabeled learning technique. Our experimental results indicated that PUL-PUP outperforms the other methods significantly for the prediction of pupylation sites. As an application, PUL-PUP was also used to predict the most likely pupylation sites in nonannotated lysine sites. |
format | Online Article Text |
id | pubmed-4992543 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-49925432016-08-30 Positive-Unlabeled Learning for Pupylation Sites Prediction Jiang, Ming Cao, Jun-Zhe Biomed Res Int Research Article Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have been developed to identify pupylation sites because the traditional experimental methods are time-consuming and labor-sensitive. With the existing computational methods, the experimentally annotated pupylation sites are used as the positive training set and the remaining nonannotated lysine residues as the negative training set to build classifiers to predict new pupylation sites from the unknown proteins. However, the remaining nonannotated lysine residues may contain pupylation sites which have not been experimentally validated yet. Unlike previous methods, in this study, the experimentally annotated pupylation sites were used as the positive training set whereas the remaining nonannotated lysine residues were used as the unlabeled training set. A novel method named PUL-PUP was proposed to predict pupylation sites by using positive-unlabeled learning technique. Our experimental results indicated that PUL-PUP outperforms the other methods significantly for the prediction of pupylation sites. As an application, PUL-PUP was also used to predict the most likely pupylation sites in nonannotated lysine sites. Hindawi Publishing Corporation 2016 2016-08-07 /pmc/articles/PMC4992543/ /pubmed/27579315 http://dx.doi.org/10.1155/2016/4525786 Text en Copyright © 2016 M. Jiang and J.-Z. Cao. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Jiang, Ming Cao, Jun-Zhe Positive-Unlabeled Learning for Pupylation Sites Prediction |
title | Positive-Unlabeled Learning for Pupylation Sites Prediction |
title_full | Positive-Unlabeled Learning for Pupylation Sites Prediction |
title_fullStr | Positive-Unlabeled Learning for Pupylation Sites Prediction |
title_full_unstemmed | Positive-Unlabeled Learning for Pupylation Sites Prediction |
title_short | Positive-Unlabeled Learning for Pupylation Sites Prediction |
title_sort | positive-unlabeled learning for pupylation sites prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992543/ https://www.ncbi.nlm.nih.gov/pubmed/27579315 http://dx.doi.org/10.1155/2016/4525786 |
work_keys_str_mv | AT jiangming positiveunlabeledlearningforpupylationsitesprediction AT caojunzhe positiveunlabeledlearningforpupylationsitesprediction |