Cargando…
An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites
BACKGROUND: Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, existing methods generally rely on feature engineering, and ignore the natural si...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8543953/ https://www.ncbi.nlm.nih.gov/pubmed/34689734 http://dx.doi.org/10.1186/s12859-021-04445-5 |
_version_ | 1784589718203138048 |
---|---|
author | He, Fei Li, Jingyi Wang, Rui Zhao, Xiaowei Han, Ye |
author_facet | He, Fei Li, Jingyi Wang, Rui Zhao, Xiaowei Han, Ye |
author_sort | He, Fei |
collection | PubMed |
description | BACKGROUND: Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, existing methods generally rely on feature engineering, and ignore the natural similarity between the two types of protein translational modification. This study is the first all-in-one deep network to predict protein Ubiquitylation and SUMOylation sites from protein sequences as well as their crosstalk sites simultaneously. Our deep learning architecture integrates several meta classifiers that apply deep neural networks to protein sequence information and physico-chemical properties, which were trained on multi-label classification mode for simultaneously identifying protein Ubiquitylation and SUMOylation as well as their crosstalk sites. RESULTS: The promising AUCs of our method on Ubiquitylation, SUMOylation and crosstalk sites achieved 0.838, 0.888, and 0.862 respectively on tenfold cross-validation. The corresponding APs reached 0.683, 0.804 and 0.552, which also validated our effectiveness. CONCLUSIONS: The proposed architecture managed to classify ubiquitylated and SUMOylated lysine residues along with their crosstalk sites, and outperformed other well-known Ubiquitylation and SUMOylation site prediction tools. |
format | Online Article Text |
id | pubmed-8543953 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-85439532021-10-26 An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites He, Fei Li, Jingyi Wang, Rui Zhao, Xiaowei Han, Ye BMC Bioinformatics Methodology Article BACKGROUND: Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, existing methods generally rely on feature engineering, and ignore the natural similarity between the two types of protein translational modification. This study is the first all-in-one deep network to predict protein Ubiquitylation and SUMOylation sites from protein sequences as well as their crosstalk sites simultaneously. Our deep learning architecture integrates several meta classifiers that apply deep neural networks to protein sequence information and physico-chemical properties, which were trained on multi-label classification mode for simultaneously identifying protein Ubiquitylation and SUMOylation as well as their crosstalk sites. RESULTS: The promising AUCs of our method on Ubiquitylation, SUMOylation and crosstalk sites achieved 0.838, 0.888, and 0.862 respectively on tenfold cross-validation. The corresponding APs reached 0.683, 0.804 and 0.552, which also validated our effectiveness. CONCLUSIONS: The proposed architecture managed to classify ubiquitylated and SUMOylated lysine residues along with their crosstalk sites, and outperformed other well-known Ubiquitylation and SUMOylation site prediction tools. BioMed Central 2021-10-24 /pmc/articles/PMC8543953/ /pubmed/34689734 http://dx.doi.org/10.1186/s12859-021-04445-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article He, Fei Li, Jingyi Wang, Rui Zhao, Xiaowei Han, Ye An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites |
title | An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites |
title_full | An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites |
title_fullStr | An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites |
title_full_unstemmed | An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites |
title_short | An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites |
title_sort | ensemble deep learning based predictor for simultaneously identifying protein ubiquitylation and sumoylation sites |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8543953/ https://www.ncbi.nlm.nih.gov/pubmed/34689734 http://dx.doi.org/10.1186/s12859-021-04445-5 |
work_keys_str_mv | AT hefei anensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT lijingyi anensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT wangrui anensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT zhaoxiaowei anensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT hanye anensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT hefei ensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT lijingyi ensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT wangrui ensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT zhaoxiaowei ensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites AT hanye ensembledeeplearningbasedpredictorforsimultaneouslyidentifyingproteinubiquitylationandsumoylationsites |