Cargando…
An application of topological data analysis in predicting sumoylation sites
Sumoylation is a reversible post-translational modification that regulates certain significant biochemical functions in proteins. The protein alterations caused by sumoylation are associated with the incidence of some human diseases. Therefore, identifying the sites of sumoylation in proteins may pr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10576966/ https://www.ncbi.nlm.nih.gov/pubmed/37846308 http://dx.doi.org/10.7717/peerj.16204 |
_version_ | 1785121227382194176 |
---|---|
author | Lin, Xiaoxi Gao, Yaru Lei, Fengchun |
author_facet | Lin, Xiaoxi Gao, Yaru Lei, Fengchun |
author_sort | Lin, Xiaoxi |
collection | PubMed |
description | Sumoylation is a reversible post-translational modification that regulates certain significant biochemical functions in proteins. The protein alterations caused by sumoylation are associated with the incidence of some human diseases. Therefore, identifying the sites of sumoylation in proteins may provide a direction for mechanistic research and drug development. Here, we propose a new computational approach for identifying sumoylation sites using an encoding method based on topological data analysis. The features of our model captured the key physical and biological properties of proteins at multiple scales. In a 10-fold cross validation, the outcomes of our model showed 96.45% of sensitivity (Sn), 94.65% of accuracy (Acc), 0.8946 of Matthew’s correlation coefficient (MCC), and 0.99 of area under curve (AUC). The proposed predictor with only topological features achieves the best MCC and AUC in comparison to the other released methods. Our results suggest that topological information is an additional parameter that can assist in the prediction of sumoylation sites and provide a novel perspective for further research in protein sumoylation. |
format | Online Article Text |
id | pubmed-10576966 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105769662023-10-16 An application of topological data analysis in predicting sumoylation sites Lin, Xiaoxi Gao, Yaru Lei, Fengchun PeerJ Biochemistry Sumoylation is a reversible post-translational modification that regulates certain significant biochemical functions in proteins. The protein alterations caused by sumoylation are associated with the incidence of some human diseases. Therefore, identifying the sites of sumoylation in proteins may provide a direction for mechanistic research and drug development. Here, we propose a new computational approach for identifying sumoylation sites using an encoding method based on topological data analysis. The features of our model captured the key physical and biological properties of proteins at multiple scales. In a 10-fold cross validation, the outcomes of our model showed 96.45% of sensitivity (Sn), 94.65% of accuracy (Acc), 0.8946 of Matthew’s correlation coefficient (MCC), and 0.99 of area under curve (AUC). The proposed predictor with only topological features achieves the best MCC and AUC in comparison to the other released methods. Our results suggest that topological information is an additional parameter that can assist in the prediction of sumoylation sites and provide a novel perspective for further research in protein sumoylation. PeerJ Inc. 2023-10-12 /pmc/articles/PMC10576966/ /pubmed/37846308 http://dx.doi.org/10.7717/peerj.16204 Text en ©2023 Lin et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Biochemistry Lin, Xiaoxi Gao, Yaru Lei, Fengchun An application of topological data analysis in predicting sumoylation sites |
title | An application of topological data analysis in predicting sumoylation sites |
title_full | An application of topological data analysis in predicting sumoylation sites |
title_fullStr | An application of topological data analysis in predicting sumoylation sites |
title_full_unstemmed | An application of topological data analysis in predicting sumoylation sites |
title_short | An application of topological data analysis in predicting sumoylation sites |
title_sort | application of topological data analysis in predicting sumoylation sites |
topic | Biochemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10576966/ https://www.ncbi.nlm.nih.gov/pubmed/37846308 http://dx.doi.org/10.7717/peerj.16204 |
work_keys_str_mv | AT linxiaoxi anapplicationoftopologicaldataanalysisinpredictingsumoylationsites AT gaoyaru anapplicationoftopologicaldataanalysisinpredictingsumoylationsites AT leifengchun anapplicationoftopologicaldataanalysisinpredictingsumoylationsites AT linxiaoxi applicationoftopologicaldataanalysisinpredictingsumoylationsites AT gaoyaru applicationoftopologicaldataanalysisinpredictingsumoylationsites AT leifengchun applicationoftopologicaldataanalysisinpredictingsumoylationsites |