Cargando…
Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features
OBJECTIVE: The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information. METHODS: Primary sequences were t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10331097/ https://www.ncbi.nlm.nih.gov/pubmed/37434723 http://dx.doi.org/10.1177/20552076231180739 |
_version_ | 1785070190952710144 |
---|---|
author | Perveen, Gulnaz Alturise, Fahad Alkhalifah, Tamim Daanial Khan, Yaser |
author_facet | Perveen, Gulnaz Alturise, Fahad Alkhalifah, Tamim Daanial Khan, Yaser |
author_sort | Perveen, Gulnaz |
collection | PubMed |
description | OBJECTIVE: The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information. METHODS: Primary sequences were transformed into feature vectors using statistical and position-relative moment-based features. Varying machine learning algorithms were employed for classification. Computational models were rigorously evaluated using four different validation. The Hemolytic-Pred webserver is available for further analysis at http://ec2-54-160-229-10.compute-1.amazonaws.com/. RESULTS: XGBoost outperformed the other six classifiers with an accuracy value of 0.99, 0.98, 0.97, and 0.98 for self-consistency test, 10-fold cross-validation, Jackknife test, and independent set test, respectively. The proposed method with the XGBoost classifier is a workable and robust solution for predicting hemolytic proteins efficiently and accurately. CONCLUSIONS: The proposed method of Hemolytic-Pred with XGBoost classifier is a reliable tool for the timely identification of hemolytic cells and diagnosis of various related severe disorders. The application of Hemolytic-Pred can yield profound benefits in the medical field. |
format | Online Article Text |
id | pubmed-10331097 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-103310972023-07-11 Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features Perveen, Gulnaz Alturise, Fahad Alkhalifah, Tamim Daanial Khan, Yaser Digit Health Original Research OBJECTIVE: The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information. METHODS: Primary sequences were transformed into feature vectors using statistical and position-relative moment-based features. Varying machine learning algorithms were employed for classification. Computational models were rigorously evaluated using four different validation. The Hemolytic-Pred webserver is available for further analysis at http://ec2-54-160-229-10.compute-1.amazonaws.com/. RESULTS: XGBoost outperformed the other six classifiers with an accuracy value of 0.99, 0.98, 0.97, and 0.98 for self-consistency test, 10-fold cross-validation, Jackknife test, and independent set test, respectively. The proposed method with the XGBoost classifier is a workable and robust solution for predicting hemolytic proteins efficiently and accurately. CONCLUSIONS: The proposed method of Hemolytic-Pred with XGBoost classifier is a reliable tool for the timely identification of hemolytic cells and diagnosis of various related severe disorders. The application of Hemolytic-Pred can yield profound benefits in the medical field. SAGE Publications 2023-07-05 /pmc/articles/PMC10331097/ /pubmed/37434723 http://dx.doi.org/10.1177/20552076231180739 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License (https://creativecommons.org/licenses/by-nc-nd/4.0/) which permits non-commercial use, reproduction and distribution of the work as published without adaptation or alteration, without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Perveen, Gulnaz Alturise, Fahad Alkhalifah, Tamim Daanial Khan, Yaser Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features |
title | Hemolytic-Pred: A machine learning-based predictor for hemolytic
proteins using position and composition-based features |
title_full | Hemolytic-Pred: A machine learning-based predictor for hemolytic
proteins using position and composition-based features |
title_fullStr | Hemolytic-Pred: A machine learning-based predictor for hemolytic
proteins using position and composition-based features |
title_full_unstemmed | Hemolytic-Pred: A machine learning-based predictor for hemolytic
proteins using position and composition-based features |
title_short | Hemolytic-Pred: A machine learning-based predictor for hemolytic
proteins using position and composition-based features |
title_sort | hemolytic-pred: a machine learning-based predictor for hemolytic
proteins using position and composition-based features |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10331097/ https://www.ncbi.nlm.nih.gov/pubmed/37434723 http://dx.doi.org/10.1177/20552076231180739 |
work_keys_str_mv | AT perveengulnaz hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures AT alturisefahad hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures AT alkhalifahtamim hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures AT daanialkhanyaser hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures |