Cargando…

Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis

BACKGROUND: With the global spread of COVID-19, the world has seen many patients, including many severe cases. The rapid development of machine learning (ML) has made significant disease diagnosis and prediction achievements. Current studies have confirmed that omics data at the host level can refle...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Xuehan, Hasan, Md Rakibul, Ahmed, Khandaker Asif, Hossain, Md Zakir
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2023
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9817417/ https://www.ncbi.nlm.nih.gov/pubmed/36609221 http://dx.doi.org/10.1186/s12859-022-05127-6

_version_	1784864750624047104
author	Liu, Xuehan Hasan, Md Rakibul Ahmed, Khandaker Asif Hossain, Md Zakir
author_facet	Liu, Xuehan Hasan, Md Rakibul Ahmed, Khandaker Asif Hossain, Md Zakir
author_sort	Liu, Xuehan
collection	PubMed
description	BACKGROUND: With the global spread of COVID-19, the world has seen many patients, including many severe cases. The rapid development of machine learning (ML) has made significant disease diagnosis and prediction achievements. Current studies have confirmed that omics data at the host level can reflect the development process and prognosis of the disease. Since early diagnosis and effective treatment of severe COVID-19 patients remains challenging, this research aims to use omics data in different ML models for COVID-19 diagnosis and prognosis. We used several ML models on omics data of a large number of individuals to first predict whether patients are COVID-19 positive or negative, followed by the severity of the disease. RESULTS: On the COVID-19 diagnosis task, we got the best AUC of 0.99 with our multilayer perceptron model and the highest F1-score of 0.95 with our logistic regression (LR) model. For the severity prediction task, we achieved the highest accuracy of 0.76 with an LR model. Beyond classification and predictive modeling, our study founds ML models performed better on integrated multi-omics data, rather than single omics. By comparing top features from different omics dataset, we also found the robustness of our model, with a wider range of applicability in diverse dataset related to COVID-19. Additionally, we have found that omics-based models performed better than image or physiological feature-based models, proving the importance of the omics-based dataset for future model development. CONCLUSIONS: This study diagnoses COVID-19 positive cases and predicts accurate severity levels. It lowers the dependence on clinical data and professional judgment, by leveraging the utilization of state-of-the-art models. our model showed wider applicability across different omics dataset, which is highly transferable in other respiratory or similar diseases. Hospital and public health care mechanisms can optimize the distribution of medical resources and improve the robustness of the medical system.
format	Online Article Text
id	pubmed-9817417
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-98174172023-01-06 Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis Liu, Xuehan Hasan, Md Rakibul Ahmed, Khandaker Asif Hossain, Md Zakir BMC Bioinformatics Research BACKGROUND: With the global spread of COVID-19, the world has seen many patients, including many severe cases. The rapid development of machine learning (ML) has made significant disease diagnosis and prediction achievements. Current studies have confirmed that omics data at the host level can reflect the development process and prognosis of the disease. Since early diagnosis and effective treatment of severe COVID-19 patients remains challenging, this research aims to use omics data in different ML models for COVID-19 diagnosis and prognosis. We used several ML models on omics data of a large number of individuals to first predict whether patients are COVID-19 positive or negative, followed by the severity of the disease. RESULTS: On the COVID-19 diagnosis task, we got the best AUC of 0.99 with our multilayer perceptron model and the highest F1-score of 0.95 with our logistic regression (LR) model. For the severity prediction task, we achieved the highest accuracy of 0.76 with an LR model. Beyond classification and predictive modeling, our study founds ML models performed better on integrated multi-omics data, rather than single omics. By comparing top features from different omics dataset, we also found the robustness of our model, with a wider range of applicability in diverse dataset related to COVID-19. Additionally, we have found that omics-based models performed better than image or physiological feature-based models, proving the importance of the omics-based dataset for future model development. CONCLUSIONS: This study diagnoses COVID-19 positive cases and predicts accurate severity levels. It lowers the dependence on clinical data and professional judgment, by leveraging the utilization of state-of-the-art models. our model showed wider applicability across different omics dataset, which is highly transferable in other respiratory or similar diseases. Hospital and public health care mechanisms can optimize the distribution of medical resources and improve the robustness of the medical system. BioMed Central 2023-01-06 /pmc/articles/PMC9817417/ /pubmed/36609221 http://dx.doi.org/10.1186/s12859-022-05127-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Liu, Xuehan Hasan, Md Rakibul Ahmed, Khandaker Asif Hossain, Md Zakir Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis
title	Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis
title_full	Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis
title_fullStr	Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis
title_full_unstemmed	Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis
title_short	Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis
title_sort	machine learning to analyse omic-data for covid-19 diagnosis and prognosis
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9817417/ https://www.ncbi.nlm.nih.gov/pubmed/36609221 http://dx.doi.org/10.1186/s12859-022-05127-6
work_keys_str_mv	AT liuxuehan machinelearningtoanalyseomicdataforcovid19diagnosisandprognosis AT hasanmdrakibul machinelearningtoanalyseomicdataforcovid19diagnosisandprognosis AT ahmedkhandakerasif machinelearningtoanalyseomicdataforcovid19diagnosisandprognosis AT hossainmdzakir machinelearningtoanalyseomicdataforcovid19diagnosisandprognosis

Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis

Ejemplares similares