Cargando…

Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults

Nonalcoholic fatty liver disease (NAFLD) is one of major causes of end-stage liver disease in the coming decades, but it shows few symptoms until it develops into cirrhosis. We aim to develop classification models with machine learning to screen NAFLD patients among general adults. This study includ...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Shenghua, Hou, Xiaomin, Wen, Yuan, Wang, Chunqing, Tan, Xiaxian, Tian, Hao, Ao, Qingqing, Li, Jingze, Chu, Shuyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9984396/
https://www.ncbi.nlm.nih.gov/pubmed/36869105
http://dx.doi.org/10.1038/s41598-023-30750-5
_version_ 1784900737467154432
author Qin, Shenghua
Hou, Xiaomin
Wen, Yuan
Wang, Chunqing
Tan, Xiaxian
Tian, Hao
Ao, Qingqing
Li, Jingze
Chu, Shuyuan
author_facet Qin, Shenghua
Hou, Xiaomin
Wen, Yuan
Wang, Chunqing
Tan, Xiaxian
Tian, Hao
Ao, Qingqing
Li, Jingze
Chu, Shuyuan
author_sort Qin, Shenghua
collection PubMed
description Nonalcoholic fatty liver disease (NAFLD) is one of major causes of end-stage liver disease in the coming decades, but it shows few symptoms until it develops into cirrhosis. We aim to develop classification models with machine learning to screen NAFLD patients among general adults. This study included 14,439 adults who took health examination. We developed classification models to classify subjects with or without NAFLD using decision tree, random forest (RF), extreme gradient boosting (XGBoost) and support vector machine (SVM). The classifier with SVM was showed the best performance with the highest accuracy (0.801), positive predictive value (PPV) (0.795), F1 score (0.795), Kappa score (0.508) and area under the precision-recall curve (AUPRC) (0.712), and the second top of area under receiver operating characteristic curve (AUROC) (0.850). The second-best classifier was RF model, which was showed the highest AUROC (0.852) and the second top of accuracy (0.789), PPV (0.782), F1 score (0.782), Kappa score (0.478) and AUPRC (0.708). In conclusion, the classifier with SVM is the best one to screen NAFLD in general population based on the results from physical examination and blood testing, followed by the classifier with RF. Those classifiers have a potential to screen NAFLD in general population for physician and primary care doctors, which could benefit to NAFLD patients from early diagnosis.
format Online
Article
Text
id pubmed-9984396
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99843962023-03-05 Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults Qin, Shenghua Hou, Xiaomin Wen, Yuan Wang, Chunqing Tan, Xiaxian Tian, Hao Ao, Qingqing Li, Jingze Chu, Shuyuan Sci Rep Article Nonalcoholic fatty liver disease (NAFLD) is one of major causes of end-stage liver disease in the coming decades, but it shows few symptoms until it develops into cirrhosis. We aim to develop classification models with machine learning to screen NAFLD patients among general adults. This study included 14,439 adults who took health examination. We developed classification models to classify subjects with or without NAFLD using decision tree, random forest (RF), extreme gradient boosting (XGBoost) and support vector machine (SVM). The classifier with SVM was showed the best performance with the highest accuracy (0.801), positive predictive value (PPV) (0.795), F1 score (0.795), Kappa score (0.508) and area under the precision-recall curve (AUPRC) (0.712), and the second top of area under receiver operating characteristic curve (AUROC) (0.850). The second-best classifier was RF model, which was showed the highest AUROC (0.852) and the second top of accuracy (0.789), PPV (0.782), F1 score (0.782), Kappa score (0.478) and AUPRC (0.708). In conclusion, the classifier with SVM is the best one to screen NAFLD in general population based on the results from physical examination and blood testing, followed by the classifier with RF. Those classifiers have a potential to screen NAFLD in general population for physician and primary care doctors, which could benefit to NAFLD patients from early diagnosis. Nature Publishing Group UK 2023-03-03 /pmc/articles/PMC9984396/ /pubmed/36869105 http://dx.doi.org/10.1038/s41598-023-30750-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Qin, Shenghua
Hou, Xiaomin
Wen, Yuan
Wang, Chunqing
Tan, Xiaxian
Tian, Hao
Ao, Qingqing
Li, Jingze
Chu, Shuyuan
Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults
title Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults
title_full Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults
title_fullStr Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults
title_full_unstemmed Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults
title_short Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults
title_sort machine learning classifiers for screening nonalcoholic fatty liver disease in general adults
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9984396/
https://www.ncbi.nlm.nih.gov/pubmed/36869105
http://dx.doi.org/10.1038/s41598-023-30750-5
work_keys_str_mv AT qinshenghua machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT houxiaomin machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT wenyuan machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT wangchunqing machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT tanxiaxian machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT tianhao machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT aoqingqing machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT lijingze machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults
AT chushuyuan machinelearningclassifiersforscreeningnonalcoholicfattyliverdiseaseingeneraladults