Cargando…

Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases

Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea’s proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to i...

Descripción completa

Detalles Bibliográficos
Autores principales: Samman, Nour, Mohabatkar, Hassan, Rabiei, Parisa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Shiraz University 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10387176/
https://www.ncbi.nlm.nih.gov/pubmed/37525666
http://dx.doi.org/10.22099/mbrc.2023.47756.1845
_version_ 1785081825873362944
author Samman, Nour
Mohabatkar, Hassan
Rabiei, Parisa
author_facet Samman, Nour
Mohabatkar, Hassan
Rabiei, Parisa
author_sort Samman, Nour
collection PubMed
description Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea’s proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to in silico study of archaeal phospholipases’ properties and to develop a pioneering method for distinguishing these enzymes from other archaeal enzymes via machine learning algorithms and Chou’s pseudo-amino acid composition concept. The non-redundant sequences of archaeal phospholipases were collected. BioSeq-Analysis sever was used with Support Vector Machine (SVM), Random Forests (RF), Covariance Discrimination (CD), and Optimized Evidence-Theoretic K-nearest Neighbor (OET-KNN) as powerful machine learnings algorithms. Also, different Chou’s pseudo-amino acid composition modes were performed and then, 5-fold cross-validation was applied to the sequences. Based on our results, the OET-KNN predictor, with 96% accuracy, yields the best performance in SC-PseAAC mode by 5-fold cross-validation. This predictor also achieved very high values of specificity (95%), sensitivity (96%), Matthews’s correlation coefficient (0.92), and accuracy (96%). The present investigation yielded a robust anticipatory model for the archaeal phospholipase prediction utilizing the tenets PseAAC and OET-KNN machine learning algorithm.
format Online
Article
Text
id pubmed-10387176
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Shiraz University
record_format MEDLINE/PubMed
spelling pubmed-103871762023-07-31 Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases Samman, Nour Mohabatkar, Hassan Rabiei, Parisa Mol Biol Res Commun Original Article Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea’s proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to in silico study of archaeal phospholipases’ properties and to develop a pioneering method for distinguishing these enzymes from other archaeal enzymes via machine learning algorithms and Chou’s pseudo-amino acid composition concept. The non-redundant sequences of archaeal phospholipases were collected. BioSeq-Analysis sever was used with Support Vector Machine (SVM), Random Forests (RF), Covariance Discrimination (CD), and Optimized Evidence-Theoretic K-nearest Neighbor (OET-KNN) as powerful machine learnings algorithms. Also, different Chou’s pseudo-amino acid composition modes were performed and then, 5-fold cross-validation was applied to the sequences. Based on our results, the OET-KNN predictor, with 96% accuracy, yields the best performance in SC-PseAAC mode by 5-fold cross-validation. This predictor also achieved very high values of specificity (95%), sensitivity (96%), Matthews’s correlation coefficient (0.92), and accuracy (96%). The present investigation yielded a robust anticipatory model for the archaeal phospholipase prediction utilizing the tenets PseAAC and OET-KNN machine learning algorithm. Shiraz University 2023 /pmc/articles/PMC10387176/ /pubmed/37525666 http://dx.doi.org/10.22099/mbrc.2023.47756.1845 Text en https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Original Article
Samman, Nour
Mohabatkar, Hassan
Rabiei, Parisa
Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
title Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
title_full Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
title_fullStr Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
title_full_unstemmed Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
title_short Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
title_sort using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10387176/
https://www.ncbi.nlm.nih.gov/pubmed/37525666
http://dx.doi.org/10.22099/mbrc.2023.47756.1845
work_keys_str_mv AT sammannour usingseveralpseudoaminoacidcompositiontypesanddifferentmachinelearningalgorithmstoclassifyandpredictarchaealphospholipases
AT mohabatkarhassan usingseveralpseudoaminoacidcompositiontypesanddifferentmachinelearningalgorithmstoclassifyandpredictarchaealphospholipases
AT rabieiparisa usingseveralpseudoaminoacidcompositiontypesanddifferentmachinelearningalgorithmstoclassifyandpredictarchaealphospholipases