Cargando…

An application of topological data analysis in predicting sumoylation sites

Sumoylation is a reversible post-translational modification that regulates certain significant biochemical functions in proteins. The protein alterations caused by sumoylation are associated with the incidence of some human diseases. Therefore, identifying the sites of sumoylation in proteins may pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Xiaoxi, Gao, Yaru, Lei, Fengchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10576966/
https://www.ncbi.nlm.nih.gov/pubmed/37846308
http://dx.doi.org/10.7717/peerj.16204
_version_ 1785121227382194176
author Lin, Xiaoxi
Gao, Yaru
Lei, Fengchun
author_facet Lin, Xiaoxi
Gao, Yaru
Lei, Fengchun
author_sort Lin, Xiaoxi
collection PubMed
description Sumoylation is a reversible post-translational modification that regulates certain significant biochemical functions in proteins. The protein alterations caused by sumoylation are associated with the incidence of some human diseases. Therefore, identifying the sites of sumoylation in proteins may provide a direction for mechanistic research and drug development. Here, we propose a new computational approach for identifying sumoylation sites using an encoding method based on topological data analysis. The features of our model captured the key physical and biological properties of proteins at multiple scales. In a 10-fold cross validation, the outcomes of our model showed 96.45% of sensitivity (Sn), 94.65% of accuracy (Acc), 0.8946 of Matthew’s correlation coefficient (MCC), and 0.99 of area under curve (AUC). The proposed predictor with only topological features achieves the best MCC and AUC in comparison to the other released methods. Our results suggest that topological information is an additional parameter that can assist in the prediction of sumoylation sites and provide a novel perspective for further research in protein sumoylation.
format Online
Article
Text
id pubmed-10576966
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-105769662023-10-16 An application of topological data analysis in predicting sumoylation sites Lin, Xiaoxi Gao, Yaru Lei, Fengchun PeerJ Biochemistry Sumoylation is a reversible post-translational modification that regulates certain significant biochemical functions in proteins. The protein alterations caused by sumoylation are associated with the incidence of some human diseases. Therefore, identifying the sites of sumoylation in proteins may provide a direction for mechanistic research and drug development. Here, we propose a new computational approach for identifying sumoylation sites using an encoding method based on topological data analysis. The features of our model captured the key physical and biological properties of proteins at multiple scales. In a 10-fold cross validation, the outcomes of our model showed 96.45% of sensitivity (Sn), 94.65% of accuracy (Acc), 0.8946 of Matthew’s correlation coefficient (MCC), and 0.99 of area under curve (AUC). The proposed predictor with only topological features achieves the best MCC and AUC in comparison to the other released methods. Our results suggest that topological information is an additional parameter that can assist in the prediction of sumoylation sites and provide a novel perspective for further research in protein sumoylation. PeerJ Inc. 2023-10-12 /pmc/articles/PMC10576966/ /pubmed/37846308 http://dx.doi.org/10.7717/peerj.16204 Text en ©2023 Lin et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Biochemistry
Lin, Xiaoxi
Gao, Yaru
Lei, Fengchun
An application of topological data analysis in predicting sumoylation sites
title An application of topological data analysis in predicting sumoylation sites
title_full An application of topological data analysis in predicting sumoylation sites
title_fullStr An application of topological data analysis in predicting sumoylation sites
title_full_unstemmed An application of topological data analysis in predicting sumoylation sites
title_short An application of topological data analysis in predicting sumoylation sites
title_sort application of topological data analysis in predicting sumoylation sites
topic Biochemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10576966/
https://www.ncbi.nlm.nih.gov/pubmed/37846308
http://dx.doi.org/10.7717/peerj.16204
work_keys_str_mv AT linxiaoxi anapplicationoftopologicaldataanalysisinpredictingsumoylationsites
AT gaoyaru anapplicationoftopologicaldataanalysisinpredictingsumoylationsites
AT leifengchun anapplicationoftopologicaldataanalysisinpredictingsumoylationsites
AT linxiaoxi applicationoftopologicaldataanalysisinpredictingsumoylationsites
AT gaoyaru applicationoftopologicaldataanalysisinpredictingsumoylationsites
AT leifengchun applicationoftopologicaldataanalysisinpredictingsumoylationsites