Cargando…

predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance

Post-translational modification (PTM) involves covalent modification after the biosynthesis process and plays an essential role in the study of cell biology. Lysine phosphoglycerylation, a newly discovered reversible type of PTM that affects glycolytic enzyme activities, and is responsible for a wid...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Sabit, Rahman, Afrida, Hasan, Md. Al Mehedi, Islam, Md Khaled Ben, Rahman, Julia, Ahmad, Shamim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016359/
https://www.ncbi.nlm.nih.gov/pubmed/33793659
http://dx.doi.org/10.1371/journal.pone.0249396
_version_ 1783673845004107776
author Ahmed, Sabit
Rahman, Afrida
Hasan, Md. Al Mehedi
Islam, Md Khaled Ben
Rahman, Julia
Ahmad, Shamim
author_facet Ahmed, Sabit
Rahman, Afrida
Hasan, Md. Al Mehedi
Islam, Md Khaled Ben
Rahman, Julia
Ahmad, Shamim
author_sort Ahmed, Sabit
collection PubMed
description Post-translational modification (PTM) involves covalent modification after the biosynthesis process and plays an essential role in the study of cell biology. Lysine phosphoglycerylation, a newly discovered reversible type of PTM that affects glycolytic enzyme activities, and is responsible for a wide variety of diseases, such as heart failure, arthritis, and degeneration of the nervous system. Our goal is to computationally characterize potential phosphoglycerylation sites to understand the functionality and causality more accurately. In this study, a novel computational tool, referred to as predPhogly-Site, has been developed to predict phosphoglycerylation sites in the protein. It has effectively utilized the probabilistic sequence-coupling information among the nearby amino acid residues of phosphoglycerylation sites along with a variable cost adjustment for the skewed training dataset to enhance the prediction characteristics. It has achieved around 99% accuracy with more than 0.96 MCC and 0.97 AUC in both 10-fold cross-validation and independent test. Even, the standard deviation in 10-fold cross-validation is almost negligible. This performance indicates that predPhogly-Site remarkably outperformed the existing prediction tools and can be used as a promising predictor, preferably with its web interface at http://103.99.176.239/predPhogly-Site.
format Online
Article
Text
id pubmed-8016359
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-80163592021-04-08 predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance Ahmed, Sabit Rahman, Afrida Hasan, Md. Al Mehedi Islam, Md Khaled Ben Rahman, Julia Ahmad, Shamim PLoS One Research Article Post-translational modification (PTM) involves covalent modification after the biosynthesis process and plays an essential role in the study of cell biology. Lysine phosphoglycerylation, a newly discovered reversible type of PTM that affects glycolytic enzyme activities, and is responsible for a wide variety of diseases, such as heart failure, arthritis, and degeneration of the nervous system. Our goal is to computationally characterize potential phosphoglycerylation sites to understand the functionality and causality more accurately. In this study, a novel computational tool, referred to as predPhogly-Site, has been developed to predict phosphoglycerylation sites in the protein. It has effectively utilized the probabilistic sequence-coupling information among the nearby amino acid residues of phosphoglycerylation sites along with a variable cost adjustment for the skewed training dataset to enhance the prediction characteristics. It has achieved around 99% accuracy with more than 0.96 MCC and 0.97 AUC in both 10-fold cross-validation and independent test. Even, the standard deviation in 10-fold cross-validation is almost negligible. This performance indicates that predPhogly-Site remarkably outperformed the existing prediction tools and can be used as a promising predictor, preferably with its web interface at http://103.99.176.239/predPhogly-Site. Public Library of Science 2021-04-01 /pmc/articles/PMC8016359/ /pubmed/33793659 http://dx.doi.org/10.1371/journal.pone.0249396 Text en © 2021 Ahmed et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ahmed, Sabit
Rahman, Afrida
Hasan, Md. Al Mehedi
Islam, Md Khaled Ben
Rahman, Julia
Ahmad, Shamim
predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance
title predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance
title_full predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance
title_fullStr predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance
title_full_unstemmed predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance
title_short predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance
title_sort predphogly-site: predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into pseaac and addressing data imbalance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016359/
https://www.ncbi.nlm.nih.gov/pubmed/33793659
http://dx.doi.org/10.1371/journal.pone.0249396
work_keys_str_mv AT ahmedsabit predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance
AT rahmanafrida predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance
AT hasanmdalmehedi predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance
AT islammdkhaledben predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance
AT rahmanjulia predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance
AT ahmadshamim predphoglysitepredictingphosphoglycerylationsitesbyincorporatingprobabilisticsequencecouplinginformationintopseaacandaddressingdataimbalance