Cargando…

A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites

Protein phosphorylation, which is one of the most important post-translational modifications (PTMs), is involved in regulating myriad cellular processes. Herein, we present a novel deep learning based approach for organism-specific protein phosphorylation site prediction in Chlamydomonas reinhardtii...

Descripción completa

Detalles Bibliográficos
Autores principales: Thapa, Niraj, Chaudhari, Meenal, Iannetta, Anthony A., White, Clarence, Roy, Kaushik, Newman, Robert H., Hicks, Leslie M., KC, Dukka B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8206365/
https://www.ncbi.nlm.nih.gov/pubmed/34131195
http://dx.doi.org/10.1038/s41598-021-91840-w
_version_ 1783708621424558080
author Thapa, Niraj
Chaudhari, Meenal
Iannetta, Anthony A.
White, Clarence
Roy, Kaushik
Newman, Robert H.
Hicks, Leslie M.
KC, Dukka B.
author_facet Thapa, Niraj
Chaudhari, Meenal
Iannetta, Anthony A.
White, Clarence
Roy, Kaushik
Newman, Robert H.
Hicks, Leslie M.
KC, Dukka B.
author_sort Thapa, Niraj
collection PubMed
description Protein phosphorylation, which is one of the most important post-translational modifications (PTMs), is involved in regulating myriad cellular processes. Herein, we present a novel deep learning based approach for organism-specific protein phosphorylation site prediction in Chlamydomonas reinhardtii, a model algal phototroph. An ensemble model combining convolutional neural networks and long short-term memory (LSTM) achieves the best performance in predicting phosphorylation sites in C. reinhardtii. Deemed Chlamy-EnPhosSite, the measured best AUC and MCC are 0.90 and 0.64 respectively for a combined dataset of serine (S) and threonine (T) in independent testing higher than those measures for other predictors. When applied to the entire C. reinhardtii proteome (totaling 1,809,304 S and T sites), Chlamy-EnPhosSite yielded 499,411 phosphorylated sites with a cut-off value of 0.5 and 237,949 phosphorylated sites with a cut-off value of 0.7. These predictions were compared to an experimental dataset of phosphosites identified by liquid chromatography-tandem mass spectrometry (LC–MS/MS) in a blinded study and approximately 89.69% of 2,663 C. reinhardtii S and T phosphorylation sites were successfully predicted by Chlamy-EnPhosSite at a probability cut-off of 0.5 and 76.83% of sites were successfully identified at a more stringent 0.7 cut-off. Interestingly, Chlamy-EnPhosSite also successfully predicted experimentally confirmed phosphorylation sites in a protein sequence (e.g., RPS6 S245) which did not appear in the training dataset, highlighting prediction accuracy and the power of leveraging predictions to identify biologically relevant PTM sites. These results demonstrate that our method represents a robust and complementary technique for high-throughput phosphorylation site prediction in C. reinhardtii. It has potential to serve as a useful tool to the community. Chlamy-EnPhosSite will contribute to the understanding of how protein phosphorylation influences various biological processes in this important model microalga.
format Online
Article
Text
id pubmed-8206365
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82063652021-06-17 A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites Thapa, Niraj Chaudhari, Meenal Iannetta, Anthony A. White, Clarence Roy, Kaushik Newman, Robert H. Hicks, Leslie M. KC, Dukka B. Sci Rep Article Protein phosphorylation, which is one of the most important post-translational modifications (PTMs), is involved in regulating myriad cellular processes. Herein, we present a novel deep learning based approach for organism-specific protein phosphorylation site prediction in Chlamydomonas reinhardtii, a model algal phototroph. An ensemble model combining convolutional neural networks and long short-term memory (LSTM) achieves the best performance in predicting phosphorylation sites in C. reinhardtii. Deemed Chlamy-EnPhosSite, the measured best AUC and MCC are 0.90 and 0.64 respectively for a combined dataset of serine (S) and threonine (T) in independent testing higher than those measures for other predictors. When applied to the entire C. reinhardtii proteome (totaling 1,809,304 S and T sites), Chlamy-EnPhosSite yielded 499,411 phosphorylated sites with a cut-off value of 0.5 and 237,949 phosphorylated sites with a cut-off value of 0.7. These predictions were compared to an experimental dataset of phosphosites identified by liquid chromatography-tandem mass spectrometry (LC–MS/MS) in a blinded study and approximately 89.69% of 2,663 C. reinhardtii S and T phosphorylation sites were successfully predicted by Chlamy-EnPhosSite at a probability cut-off of 0.5 and 76.83% of sites were successfully identified at a more stringent 0.7 cut-off. Interestingly, Chlamy-EnPhosSite also successfully predicted experimentally confirmed phosphorylation sites in a protein sequence (e.g., RPS6 S245) which did not appear in the training dataset, highlighting prediction accuracy and the power of leveraging predictions to identify biologically relevant PTM sites. These results demonstrate that our method represents a robust and complementary technique for high-throughput phosphorylation site prediction in C. reinhardtii. It has potential to serve as a useful tool to the community. Chlamy-EnPhosSite will contribute to the understanding of how protein phosphorylation influences various biological processes in this important model microalga. Nature Publishing Group UK 2021-06-15 /pmc/articles/PMC8206365/ /pubmed/34131195 http://dx.doi.org/10.1038/s41598-021-91840-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Thapa, Niraj
Chaudhari, Meenal
Iannetta, Anthony A.
White, Clarence
Roy, Kaushik
Newman, Robert H.
Hicks, Leslie M.
KC, Dukka B.
A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
title A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
title_full A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
title_fullStr A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
title_full_unstemmed A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
title_short A deep learning based approach for prediction of Chlamydomonas reinhardtii phosphorylation sites
title_sort deep learning based approach for prediction of chlamydomonas reinhardtii phosphorylation sites
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8206365/
https://www.ncbi.nlm.nih.gov/pubmed/34131195
http://dx.doi.org/10.1038/s41598-021-91840-w
work_keys_str_mv AT thapaniraj adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT chaudharimeenal adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT iannettaanthonya adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT whiteclarence adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT roykaushik adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT newmanroberth adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT hickslesliem adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT kcdukkab adeeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT thapaniraj deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT chaudharimeenal deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT iannettaanthonya deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT whiteclarence deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT roykaushik deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT newmanroberth deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT hickslesliem deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites
AT kcdukkab deeplearningbasedapproachforpredictionofchlamydomonasreinhardtiiphosphorylationsites