Cargando…

Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy

The 2-oxoglutarate/Fe (II)-dependent (2OG) oxygenase superfamily is mainly responsible for protein modification, nucleic acid repair and/or modification, and fatty acid metabolism and plays important roles in cancer, cardiovascular disease, and other diseases. They are likely to become new targets f...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Jian, Bo, Suling, Wang, Hao, Zheng, Lei, Liang, Pengfei, Zuo, Yongchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323781/
https://www.ncbi.nlm.nih.gov/pubmed/34336861
http://dx.doi.org/10.3389/fcell.2021.707938
_version_ 1783731306957373440
author Zhou, Jian
Bo, Suling
Wang, Hao
Zheng, Lei
Liang, Pengfei
Zuo, Yongchun
author_facet Zhou, Jian
Bo, Suling
Wang, Hao
Zheng, Lei
Liang, Pengfei
Zuo, Yongchun
author_sort Zhou, Jian
collection PubMed
description The 2-oxoglutarate/Fe (II)-dependent (2OG) oxygenase superfamily is mainly responsible for protein modification, nucleic acid repair and/or modification, and fatty acid metabolism and plays important roles in cancer, cardiovascular disease, and other diseases. They are likely to become new targets for the treatment of cancer and other diseases, so the accurate identification of 2OG oxygenases is of great significance. Many computational methods have been proposed to predict functional proteins to compensate for the time-consuming and expensive experimental identification. However, machine learning has not been applied to the study of 2OG oxygenases. In this study, we developed OGFE_RAAC, a prediction model to identify whether a protein is a 2OG oxygenase. To improve the performance of OGFE_RAAC, 673 amino acid reduction alphabets were used to determine the optimal feature representation scheme by recoding the protein sequence. The 10-fold cross-validation test showed that the accuracy of the model in identifying 2OG oxygenases is 91.04%. Besides, the independent dataset results also proved that the model has excellent generalization and robustness. It is expected to become an effective tool for the identification of 2OG oxygenases. With further research, we have also found that the function of 2OG oxygenases may be related to their polarity and hydrophobicity, which will help the follow-up study on the catalytic mechanism of 2OG oxygenases and the way they interact with the substrate. Based on the model we built, a user-friendly web server was established and can be friendly accessed at http://bioinfor.imu.edu.cn/ogferaac.
format Online
Article
Text
id pubmed-8323781
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-83237812021-07-31 Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy Zhou, Jian Bo, Suling Wang, Hao Zheng, Lei Liang, Pengfei Zuo, Yongchun Front Cell Dev Biol Cell and Developmental Biology The 2-oxoglutarate/Fe (II)-dependent (2OG) oxygenase superfamily is mainly responsible for protein modification, nucleic acid repair and/or modification, and fatty acid metabolism and plays important roles in cancer, cardiovascular disease, and other diseases. They are likely to become new targets for the treatment of cancer and other diseases, so the accurate identification of 2OG oxygenases is of great significance. Many computational methods have been proposed to predict functional proteins to compensate for the time-consuming and expensive experimental identification. However, machine learning has not been applied to the study of 2OG oxygenases. In this study, we developed OGFE_RAAC, a prediction model to identify whether a protein is a 2OG oxygenase. To improve the performance of OGFE_RAAC, 673 amino acid reduction alphabets were used to determine the optimal feature representation scheme by recoding the protein sequence. The 10-fold cross-validation test showed that the accuracy of the model in identifying 2OG oxygenases is 91.04%. Besides, the independent dataset results also proved that the model has excellent generalization and robustness. It is expected to become an effective tool for the identification of 2OG oxygenases. With further research, we have also found that the function of 2OG oxygenases may be related to their polarity and hydrophobicity, which will help the follow-up study on the catalytic mechanism of 2OG oxygenases and the way they interact with the substrate. Based on the model we built, a user-friendly web server was established and can be friendly accessed at http://bioinfor.imu.edu.cn/ogferaac. Frontiers Media S.A. 2021-07-16 /pmc/articles/PMC8323781/ /pubmed/34336861 http://dx.doi.org/10.3389/fcell.2021.707938 Text en Copyright © 2021 Zhou, Bo, Wang, Zheng, Liang and Zuo. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Zhou, Jian
Bo, Suling
Wang, Hao
Zheng, Lei
Liang, Pengfei
Zuo, Yongchun
Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy
title Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy
title_full Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy
title_fullStr Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy
title_full_unstemmed Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy
title_short Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy
title_sort identification of disease-related 2-oxoglutarate/fe (ii)-dependent oxygenase based on reduced amino acid cluster strategy
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323781/
https://www.ncbi.nlm.nih.gov/pubmed/34336861
http://dx.doi.org/10.3389/fcell.2021.707938
work_keys_str_mv AT zhoujian identificationofdiseaserelated2oxoglutaratefeiidependentoxygenasebasedonreducedaminoacidclusterstrategy
AT bosuling identificationofdiseaserelated2oxoglutaratefeiidependentoxygenasebasedonreducedaminoacidclusterstrategy
AT wanghao identificationofdiseaserelated2oxoglutaratefeiidependentoxygenasebasedonreducedaminoacidclusterstrategy
AT zhenglei identificationofdiseaserelated2oxoglutaratefeiidependentoxygenasebasedonreducedaminoacidclusterstrategy
AT liangpengfei identificationofdiseaserelated2oxoglutaratefeiidependentoxygenasebasedonreducedaminoacidclusterstrategy
AT zuoyongchun identificationofdiseaserelated2oxoglutaratefeiidependentoxygenasebasedonreducedaminoacidclusterstrategy