Cargando…

A diagnostic model for COVID-19 based on proteomics analysis

BACKGROUND AND OBJECTIVE: Early diagnosis of Coronavirus Disease 2019 (COVID-19) can help save patients' lives before the disease turns severe. This can be achieved through an effective and correct treatment protocol. In this paper, a prediction model is proposed to detect infected cases and de...

Descripción completa

Detalles Bibliográficos
Autores principales: Alkady, Walaa, ElBahnasy, Khaled, Gad, Walaa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232940/
https://www.ncbi.nlm.nih.gov/pubmed/37276752
http://dx.doi.org/10.1016/j.compbiomed.2023.107109
_version_ 1785052114209210368
author Alkady, Walaa
ElBahnasy, Khaled
Gad, Walaa
author_facet Alkady, Walaa
ElBahnasy, Khaled
Gad, Walaa
author_sort Alkady, Walaa
collection PubMed
description BACKGROUND AND OBJECTIVE: Early diagnosis of Coronavirus Disease 2019 (COVID-19) can help save patients' lives before the disease turns severe. This can be achieved through an effective and correct treatment protocol. In this paper, a prediction model is proposed to detect infected cases and determine the severity level of the disease. METHODS: The proposed model is based on utilizing proteins and metabolites as features for each patient, which are then analyzed using feature selection methods such as Principal Component Analysis (PCA), Information Gain (IG), and analysis of Variance (ANOVA) to select the most significant features. The model employs three classifiers, namely K-Nearest Neighbor (KNN), Support Vector Machine (SVM), and Random Forest (RF), to predict and classify the severity level of the COVID-19 infection. The proposed model is evaluated using four performance measures: accuracy, sensitivity, specificity, and precision. RESULTS: The experiment results show that the proposed model accuracy can reach 80% using RF classifier with PCA. The PCA selects 22 proteins and 10 metabolites. While ANOVA selects 9 proteins and 5 metabolites. The accuracy reaches 92% after applying RF classifier with the ANOVA. Finally, the accuracy reaches 93% using the RF classifier with only ten features. The selected features are 7 proteins and 3 metabolites. Moreover, it shows that the selected features have a relation to the immune system and respiratory systems. CONCLUSION: The proposed model uses three classifiers and shows promising results by selecting the important features and maximizing the prediction accuracy.
format Online
Article
Text
id pubmed-10232940
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-102329402023-06-01 A diagnostic model for COVID-19 based on proteomics analysis Alkady, Walaa ElBahnasy, Khaled Gad, Walaa Comput Biol Med Article BACKGROUND AND OBJECTIVE: Early diagnosis of Coronavirus Disease 2019 (COVID-19) can help save patients' lives before the disease turns severe. This can be achieved through an effective and correct treatment protocol. In this paper, a prediction model is proposed to detect infected cases and determine the severity level of the disease. METHODS: The proposed model is based on utilizing proteins and metabolites as features for each patient, which are then analyzed using feature selection methods such as Principal Component Analysis (PCA), Information Gain (IG), and analysis of Variance (ANOVA) to select the most significant features. The model employs three classifiers, namely K-Nearest Neighbor (KNN), Support Vector Machine (SVM), and Random Forest (RF), to predict and classify the severity level of the COVID-19 infection. The proposed model is evaluated using four performance measures: accuracy, sensitivity, specificity, and precision. RESULTS: The experiment results show that the proposed model accuracy can reach 80% using RF classifier with PCA. The PCA selects 22 proteins and 10 metabolites. While ANOVA selects 9 proteins and 5 metabolites. The accuracy reaches 92% after applying RF classifier with the ANOVA. Finally, the accuracy reaches 93% using the RF classifier with only ten features. The selected features are 7 proteins and 3 metabolites. Moreover, it shows that the selected features have a relation to the immune system and respiratory systems. CONCLUSION: The proposed model uses three classifiers and shows promising results by selecting the important features and maximizing the prediction accuracy. Elsevier Ltd. 2023-08 2023-06-01 /pmc/articles/PMC10232940/ /pubmed/37276752 http://dx.doi.org/10.1016/j.compbiomed.2023.107109 Text en © 2023 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Alkady, Walaa
ElBahnasy, Khaled
Gad, Walaa
A diagnostic model for COVID-19 based on proteomics analysis
title A diagnostic model for COVID-19 based on proteomics analysis
title_full A diagnostic model for COVID-19 based on proteomics analysis
title_fullStr A diagnostic model for COVID-19 based on proteomics analysis
title_full_unstemmed A diagnostic model for COVID-19 based on proteomics analysis
title_short A diagnostic model for COVID-19 based on proteomics analysis
title_sort diagnostic model for covid-19 based on proteomics analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232940/
https://www.ncbi.nlm.nih.gov/pubmed/37276752
http://dx.doi.org/10.1016/j.compbiomed.2023.107109
work_keys_str_mv AT alkadywalaa adiagnosticmodelforcovid19basedonproteomicsanalysis
AT elbahnasykhaled adiagnosticmodelforcovid19basedonproteomicsanalysis
AT gadwalaa adiagnosticmodelforcovid19basedonproteomicsanalysis
AT alkadywalaa diagnosticmodelforcovid19basedonproteomicsanalysis
AT elbahnasykhaled diagnosticmodelforcovid19basedonproteomicsanalysis
AT gadwalaa diagnosticmodelforcovid19basedonproteomicsanalysis