Cargando…

Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features

OBJECTIVE: The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information. METHODS: Primary sequences were t...

Descripción completa

Detalles Bibliográficos
Autores principales: Perveen, Gulnaz, Alturise, Fahad, Alkhalifah, Tamim, Daanial Khan, Yaser
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10331097/
https://www.ncbi.nlm.nih.gov/pubmed/37434723
http://dx.doi.org/10.1177/20552076231180739
_version_ 1785070190952710144
author Perveen, Gulnaz
Alturise, Fahad
Alkhalifah, Tamim
Daanial Khan, Yaser
author_facet Perveen, Gulnaz
Alturise, Fahad
Alkhalifah, Tamim
Daanial Khan, Yaser
author_sort Perveen, Gulnaz
collection PubMed
description OBJECTIVE: The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information. METHODS: Primary sequences were transformed into feature vectors using statistical and position-relative moment-based features. Varying machine learning algorithms were employed for classification. Computational models were rigorously evaluated using four different validation. The Hemolytic-Pred webserver is available for further analysis at http://ec2-54-160-229-10.compute-1.amazonaws.com/. RESULTS: XGBoost outperformed the other six classifiers with an accuracy value of 0.99, 0.98, 0.97, and 0.98 for self-consistency test, 10-fold cross-validation, Jackknife test, and independent set test, respectively. The proposed method with the XGBoost classifier is a workable and robust solution for predicting hemolytic proteins efficiently and accurately. CONCLUSIONS: The proposed method of Hemolytic-Pred with XGBoost classifier is a reliable tool for the timely identification of hemolytic cells and diagnosis of various related severe disorders. The application of Hemolytic-Pred can yield profound benefits in the medical field.
format Online
Article
Text
id pubmed-10331097
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-103310972023-07-11 Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features Perveen, Gulnaz Alturise, Fahad Alkhalifah, Tamim Daanial Khan, Yaser Digit Health Original Research OBJECTIVE: The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information. METHODS: Primary sequences were transformed into feature vectors using statistical and position-relative moment-based features. Varying machine learning algorithms were employed for classification. Computational models were rigorously evaluated using four different validation. The Hemolytic-Pred webserver is available for further analysis at http://ec2-54-160-229-10.compute-1.amazonaws.com/. RESULTS: XGBoost outperformed the other six classifiers with an accuracy value of 0.99, 0.98, 0.97, and 0.98 for self-consistency test, 10-fold cross-validation, Jackknife test, and independent set test, respectively. The proposed method with the XGBoost classifier is a workable and robust solution for predicting hemolytic proteins efficiently and accurately. CONCLUSIONS: The proposed method of Hemolytic-Pred with XGBoost classifier is a reliable tool for the timely identification of hemolytic cells and diagnosis of various related severe disorders. The application of Hemolytic-Pred can yield profound benefits in the medical field. SAGE Publications 2023-07-05 /pmc/articles/PMC10331097/ /pubmed/37434723 http://dx.doi.org/10.1177/20552076231180739 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License (https://creativecommons.org/licenses/by-nc-nd/4.0/) which permits non-commercial use, reproduction and distribution of the work as published without adaptation or alteration, without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Perveen, Gulnaz
Alturise, Fahad
Alkhalifah, Tamim
Daanial Khan, Yaser
Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features
title Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features
title_full Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features
title_fullStr Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features
title_full_unstemmed Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features
title_short Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features
title_sort hemolytic-pred: a machine learning-based predictor for hemolytic proteins using position and composition-based features
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10331097/
https://www.ncbi.nlm.nih.gov/pubmed/37434723
http://dx.doi.org/10.1177/20552076231180739
work_keys_str_mv AT perveengulnaz hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures
AT alturisefahad hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures
AT alkhalifahtamim hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures
AT daanialkhanyaser hemolyticpredamachinelearningbasedpredictorforhemolyticproteinsusingpositionandcompositionbasedfeatures