Cargando…
Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases
Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea’s proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to i...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Shiraz University
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10387176/ https://www.ncbi.nlm.nih.gov/pubmed/37525666 http://dx.doi.org/10.22099/mbrc.2023.47756.1845 |
_version_ | 1785081825873362944 |
---|---|
author | Samman, Nour Mohabatkar, Hassan Rabiei, Parisa |
author_facet | Samman, Nour Mohabatkar, Hassan Rabiei, Parisa |
author_sort | Samman, Nour |
collection | PubMed |
description | Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea’s proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to in silico study of archaeal phospholipases’ properties and to develop a pioneering method for distinguishing these enzymes from other archaeal enzymes via machine learning algorithms and Chou’s pseudo-amino acid composition concept. The non-redundant sequences of archaeal phospholipases were collected. BioSeq-Analysis sever was used with Support Vector Machine (SVM), Random Forests (RF), Covariance Discrimination (CD), and Optimized Evidence-Theoretic K-nearest Neighbor (OET-KNN) as powerful machine learnings algorithms. Also, different Chou’s pseudo-amino acid composition modes were performed and then, 5-fold cross-validation was applied to the sequences. Based on our results, the OET-KNN predictor, with 96% accuracy, yields the best performance in SC-PseAAC mode by 5-fold cross-validation. This predictor also achieved very high values of specificity (95%), sensitivity (96%), Matthews’s correlation coefficient (0.92), and accuracy (96%). The present investigation yielded a robust anticipatory model for the archaeal phospholipase prediction utilizing the tenets PseAAC and OET-KNN machine learning algorithm. |
format | Online Article Text |
id | pubmed-10387176 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Shiraz University |
record_format | MEDLINE/PubMed |
spelling | pubmed-103871762023-07-31 Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases Samman, Nour Mohabatkar, Hassan Rabiei, Parisa Mol Biol Res Commun Original Article Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea’s proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to in silico study of archaeal phospholipases’ properties and to develop a pioneering method for distinguishing these enzymes from other archaeal enzymes via machine learning algorithms and Chou’s pseudo-amino acid composition concept. The non-redundant sequences of archaeal phospholipases were collected. BioSeq-Analysis sever was used with Support Vector Machine (SVM), Random Forests (RF), Covariance Discrimination (CD), and Optimized Evidence-Theoretic K-nearest Neighbor (OET-KNN) as powerful machine learnings algorithms. Also, different Chou’s pseudo-amino acid composition modes were performed and then, 5-fold cross-validation was applied to the sequences. Based on our results, the OET-KNN predictor, with 96% accuracy, yields the best performance in SC-PseAAC mode by 5-fold cross-validation. This predictor also achieved very high values of specificity (95%), sensitivity (96%), Matthews’s correlation coefficient (0.92), and accuracy (96%). The present investigation yielded a robust anticipatory model for the archaeal phospholipase prediction utilizing the tenets PseAAC and OET-KNN machine learning algorithm. Shiraz University 2023 /pmc/articles/PMC10387176/ /pubmed/37525666 http://dx.doi.org/10.22099/mbrc.2023.47756.1845 Text en https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Original Article Samman, Nour Mohabatkar, Hassan Rabiei, Parisa Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases |
title | Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases |
title_full | Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases |
title_fullStr | Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases |
title_full_unstemmed | Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases |
title_short | Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases |
title_sort | using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10387176/ https://www.ncbi.nlm.nih.gov/pubmed/37525666 http://dx.doi.org/10.22099/mbrc.2023.47756.1845 |
work_keys_str_mv | AT sammannour usingseveralpseudoaminoacidcompositiontypesanddifferentmachinelearningalgorithmstoclassifyandpredictarchaealphospholipases AT mohabatkarhassan usingseveralpseudoaminoacidcompositiontypesanddifferentmachinelearningalgorithmstoclassifyandpredictarchaealphospholipases AT rabieiparisa usingseveralpseudoaminoacidcompositiontypesanddifferentmachinelearningalgorithmstoclassifyandpredictarchaealphospholipases |