Cargando…

iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest

Acetylation is the most important post-translation modification (PTM) in eukaryotes; it has manifold effects on the level of protein that transform an acetyl group from an acetyl coenzyme to a specific site on a polypeptide chain. Acetylation sites play many important roles, including regulating mem...

Descripción completa

Detalles Bibliográficos
Autores principales: Malebary, Sharaf, Rahman, Shaista, Barukab, Omar, Ash’ari, Rehab, Khan, Sher Afzal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955084/
https://www.ncbi.nlm.nih.gov/pubmed/35323738
http://dx.doi.org/10.3390/membranes12030265
_version_ 1784676251608285184
author Malebary, Sharaf
Rahman, Shaista
Barukab, Omar
Ash’ari, Rehab
Khan, Sher Afzal
author_facet Malebary, Sharaf
Rahman, Shaista
Barukab, Omar
Ash’ari, Rehab
Khan, Sher Afzal
author_sort Malebary, Sharaf
collection PubMed
description Acetylation is the most important post-translation modification (PTM) in eukaryotes; it has manifold effects on the level of protein that transform an acetyl group from an acetyl coenzyme to a specific site on a polypeptide chain. Acetylation sites play many important roles, including regulating membrane protein functions and strongly affecting the membrane interaction of proteins and membrane remodeling. Because of these properties, its correct identification is essential to understand its mechanism in biological systems. As such, some traditional methods, such as mass spectrometry and site-directed mutagenesis, are used, but they are tedious and time-consuming. To overcome such limitations, many computer models are being developed to correctly identify their sequences from non-acetyl sequences, but they have poor efficiency in terms of accuracy, sensitivity, and specificity. This work proposes an efficient and accurate computational model for predicting Acetylation using machine learning approaches. The proposed model achieved an accuracy of 100 percent with the 10-fold cross-validation test based on the Random Forest classifier, along with a feature extraction approach using statistical moments. The model is also validated by the jackknife, self-consistency, and independent test, which achieved an accuracy of 100, 100, and 97, respectively, results far better as compared to the already existing models available in the literature.
format Online
Article
Text
id pubmed-8955084
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89550842022-03-26 iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest Malebary, Sharaf Rahman, Shaista Barukab, Omar Ash’ari, Rehab Khan, Sher Afzal Membranes (Basel) Article Acetylation is the most important post-translation modification (PTM) in eukaryotes; it has manifold effects on the level of protein that transform an acetyl group from an acetyl coenzyme to a specific site on a polypeptide chain. Acetylation sites play many important roles, including regulating membrane protein functions and strongly affecting the membrane interaction of proteins and membrane remodeling. Because of these properties, its correct identification is essential to understand its mechanism in biological systems. As such, some traditional methods, such as mass spectrometry and site-directed mutagenesis, are used, but they are tedious and time-consuming. To overcome such limitations, many computer models are being developed to correctly identify their sequences from non-acetyl sequences, but they have poor efficiency in terms of accuracy, sensitivity, and specificity. This work proposes an efficient and accurate computational model for predicting Acetylation using machine learning approaches. The proposed model achieved an accuracy of 100 percent with the 10-fold cross-validation test based on the Random Forest classifier, along with a feature extraction approach using statistical moments. The model is also validated by the jackknife, self-consistency, and independent test, which achieved an accuracy of 100, 100, and 97, respectively, results far better as compared to the already existing models available in the literature. MDPI 2022-02-25 /pmc/articles/PMC8955084/ /pubmed/35323738 http://dx.doi.org/10.3390/membranes12030265 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Malebary, Sharaf
Rahman, Shaista
Barukab, Omar
Ash’ari, Rehab
Khan, Sher Afzal
iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest
title iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest
title_full iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest
title_fullStr iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest
title_full_unstemmed iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest
title_short iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest
title_sort iacety–smrf: identification of acetylation protein by using statistical moments and random forest
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955084/
https://www.ncbi.nlm.nih.gov/pubmed/35323738
http://dx.doi.org/10.3390/membranes12030265
work_keys_str_mv AT malebarysharaf iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest
AT rahmanshaista iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest
AT barukabomar iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest
AT asharirehab iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest
AT khansherafzal iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest