Cargando…
predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance
Post-translational modification (PTM) involves covalent modification after the biosynthesis process and plays an essential role in the study of cell biology. Lysine phosphoglycerylation, a newly discovered reversible type of PTM that affects glycolytic enzyme activities, and is responsible for a wid...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016359/ https://www.ncbi.nlm.nih.gov/pubmed/33793659 http://dx.doi.org/10.1371/journal.pone.0249396 |
_version_ | 1783673845004107776 |
---|---|
author | Ahmed, Sabit Rahman, Afrida Hasan, Md. Al Mehedi Islam, Md Khaled Ben Rahman, Julia Ahmad, Shamim |
author_facet | Ahmed, Sabit Rahman, Afrida Hasan, Md. Al Mehedi Islam, Md Khaled Ben Rahman, Julia Ahmad, Shamim |
author_sort | Ahmed, Sabit |
collection | PubMed |
description | Post-translational modification (PTM) involves covalent modification after the biosynthesis process and plays an essential role in the study of cell biology. Lysine phosphoglycerylation, a newly discovered reversible type of PTM that affects glycolytic enzyme activities, and is responsible for a wide variety of diseases, such as heart failure, arthritis, and degeneration of the nervous system. Our goal is to computationally characterize potential phosphoglycerylation sites to understand the functionality and causality more accurately. In this study, a novel computational tool, referred to as predPhogly-Site, has been developed to predict phosphoglycerylation sites in the protein. It has effectively utilized the probabilistic sequence-coupling information among the nearby amino acid residues of phosphoglycerylation sites along with a variable cost adjustment for the skewed training dataset to enhance the prediction characteristics. It has achieved around 99% accuracy with more than 0.96 MCC and 0.97 AUC in both 10-fold cross-validation and independent test. Even, the standard deviation in 10-fold cross-validation is almost negligible. This performance indicates that predPhogly-Site remarkably outperformed the existing prediction tools and can be used as a promising predictor, preferably with its web interface at http://103.99.176.239/predPhogly-Site. |
format | Online Article Text |
id | pubmed-8016359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-80163592021-04-08 predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance Ahmed, Sabit Rahman, Afrida Hasan, Md. Al Mehedi Islam, Md Khaled Ben Rahman, Julia Ahmad, Shamim PLoS One Research Article Post-translational modification (PTM) involves covalent modification after the biosynthesis process and plays an essential role in the study of cell biology. Lysine phosphoglycerylation, a newly discovered reversible type of PTM that affects glycolytic enzyme activities, and is responsible for a wide variety of diseases, such as heart failure, arthritis, and degeneration of the nervous system. Our goal is to computationally characterize potential phosphoglycerylation sites to understand the functionality and causality more accurately. In this study, a novel computational tool, referred to as predPhogly-Site, has been developed to predict phosphoglycerylation sites in the protein. It has effectively utilized the probabilistic sequence-coupling information among the nearby amino acid residues of phosphoglycerylation sites along with a variable cost adjustment for the skewed training dataset to enhance the prediction characteristics. It has achieved around 99% accuracy with more than 0.96 MCC and 0.97 AUC in both 10-fold cross-validation and independent test. Even, the standard deviation in 10-fold cross-validation is almost negligible. This performance indicates that predPhogly-Site remarkably outperformed the existing prediction tools and can be used as a promising predictor, preferably with its web interface at http://103.99.176.239/predPhogly-Site. Public Library of Science 2021-04-01 /pmc/articles/PMC8016359/ /pubmed/33793659 http://dx.doi.org/10.1371/journal.pone.0249396 Text en © 2021 Ahmed et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ahmed, Sabit Rahman, Afrida Hasan, Md. Al Mehedi Islam, Md Khaled Ben Rahman, Julia Ahmad, Shamim predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance |
title | predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance |
title_full | predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance |
title_fullStr | predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance |
title_full_unstemmed | predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance |
title_short | predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance |
title_sort | predphogly-site: predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into pseaac and addressing data imbalance |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016359/ https://www.ncbi.nlm.nih.gov/pubmed/33793659 http://dx.doi.org/10.1371/journal.pone.0249396 |
work_keys_str_mv | AT ahmedsabit predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance AT rahmanafrida predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance AT hasanmdalmehedi predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance AT islammdkhaledben predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance AT rahmanjulia predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance AT ahmadshamim predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance |