Cargando…
iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest
Acetylation is the most important post-translation modification (PTM) in eukaryotes; it has manifold effects on the level of protein that transform an acetyl group from an acetyl coenzyme to a specific site on a polypeptide chain. Acetylation sites play many important roles, including regulating mem...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955084/ https://www.ncbi.nlm.nih.gov/pubmed/35323738 http://dx.doi.org/10.3390/membranes12030265 |
_version_ | 1784676251608285184 |
---|---|
author | Malebary, Sharaf Rahman, Shaista Barukab, Omar Ash’ari, Rehab Khan, Sher Afzal |
author_facet | Malebary, Sharaf Rahman, Shaista Barukab, Omar Ash’ari, Rehab Khan, Sher Afzal |
author_sort | Malebary, Sharaf |
collection | PubMed |
description | Acetylation is the most important post-translation modification (PTM) in eukaryotes; it has manifold effects on the level of protein that transform an acetyl group from an acetyl coenzyme to a specific site on a polypeptide chain. Acetylation sites play many important roles, including regulating membrane protein functions and strongly affecting the membrane interaction of proteins and membrane remodeling. Because of these properties, its correct identification is essential to understand its mechanism in biological systems. As such, some traditional methods, such as mass spectrometry and site-directed mutagenesis, are used, but they are tedious and time-consuming. To overcome such limitations, many computer models are being developed to correctly identify their sequences from non-acetyl sequences, but they have poor efficiency in terms of accuracy, sensitivity, and specificity. This work proposes an efficient and accurate computational model for predicting Acetylation using machine learning approaches. The proposed model achieved an accuracy of 100 percent with the 10-fold cross-validation test based on the Random Forest classifier, along with a feature extraction approach using statistical moments. The model is also validated by the jackknife, self-consistency, and independent test, which achieved an accuracy of 100, 100, and 97, respectively, results far better as compared to the already existing models available in the literature. |
format | Online Article Text |
id | pubmed-8955084 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89550842022-03-26 iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest Malebary, Sharaf Rahman, Shaista Barukab, Omar Ash’ari, Rehab Khan, Sher Afzal Membranes (Basel) Article Acetylation is the most important post-translation modification (PTM) in eukaryotes; it has manifold effects on the level of protein that transform an acetyl group from an acetyl coenzyme to a specific site on a polypeptide chain. Acetylation sites play many important roles, including regulating membrane protein functions and strongly affecting the membrane interaction of proteins and membrane remodeling. Because of these properties, its correct identification is essential to understand its mechanism in biological systems. As such, some traditional methods, such as mass spectrometry and site-directed mutagenesis, are used, but they are tedious and time-consuming. To overcome such limitations, many computer models are being developed to correctly identify their sequences from non-acetyl sequences, but they have poor efficiency in terms of accuracy, sensitivity, and specificity. This work proposes an efficient and accurate computational model for predicting Acetylation using machine learning approaches. The proposed model achieved an accuracy of 100 percent with the 10-fold cross-validation test based on the Random Forest classifier, along with a feature extraction approach using statistical moments. The model is also validated by the jackknife, self-consistency, and independent test, which achieved an accuracy of 100, 100, and 97, respectively, results far better as compared to the already existing models available in the literature. MDPI 2022-02-25 /pmc/articles/PMC8955084/ /pubmed/35323738 http://dx.doi.org/10.3390/membranes12030265 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Malebary, Sharaf Rahman, Shaista Barukab, Omar Ash’ari, Rehab Khan, Sher Afzal iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest |
title | iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest |
title_full | iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest |
title_fullStr | iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest |
title_full_unstemmed | iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest |
title_short | iAcety–SmRF: Identification of Acetylation Protein by Using Statistical Moments and Random Forest |
title_sort | iacety–smrf: identification of acetylation protein by using statistical moments and random forest |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955084/ https://www.ncbi.nlm.nih.gov/pubmed/35323738 http://dx.doi.org/10.3390/membranes12030265 |
work_keys_str_mv | AT malebarysharaf iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest AT rahmanshaista iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest AT barukabomar iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest AT asharirehab iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest AT khansherafzal iacetysmrfidentificationofacetylationproteinbyusingstatisticalmomentsandrandomforest |