Cargando…

Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis

OBJECTIVE: Diagnosis of lupus nephritis (LN) is a complex process, which usually requires renal biopsy. We aim to establish a machine learning pipeline to help diagnosis of LN. METHODS: A cohort of 681 systemic lupus erythematosus (SLE) patients without LN and 786 SLE patients with LN was establishe...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Da-Cheng, Xu, Wang-Dong, Wang, Shen-Nan, Wang, Xiang, Leng, Wei, Fu, Lu, Liu, Xiao-Yan, Qin, Zhen, Huang, An-Fang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10257380/
https://www.ncbi.nlm.nih.gov/pubmed/37300586
http://dx.doi.org/10.1007/s00011-023-01755-7
_version_ 1785057290137632768
author Wang, Da-Cheng
Xu, Wang-Dong
Wang, Shen-Nan
Wang, Xiang
Leng, Wei
Fu, Lu
Liu, Xiao-Yan
Qin, Zhen
Huang, An-Fang
author_facet Wang, Da-Cheng
Xu, Wang-Dong
Wang, Shen-Nan
Wang, Xiang
Leng, Wei
Fu, Lu
Liu, Xiao-Yan
Qin, Zhen
Huang, An-Fang
author_sort Wang, Da-Cheng
collection PubMed
description OBJECTIVE: Diagnosis of lupus nephritis (LN) is a complex process, which usually requires renal biopsy. We aim to establish a machine learning pipeline to help diagnosis of LN. METHODS: A cohort of 681 systemic lupus erythematosus (SLE) patients without LN and 786 SLE patients with LN was established, and a total of 95 clinical, laboratory data and 17 meteorological indicators were collected. After tenfold cross-validation, the patients were divided into training set and test set. The features selected by collective feature selection method of mutual information (MI) and multisurf were used to construct the models of logistic regression, decision tree, random forest, naive Bayes, support vector machine (SVM), light gradient boosting (LGB), extreme gradient boosting (XGB), and artificial neural network (ANN), the models were compared and verified in post-analysis. RESULTS: Collective feature selection method screens out antistreptolysin (ASO), retinol binding protein (RBP), lupus anticoagulant 1 (LA1), LA2, proteinuria and other features, and the hyperparameter optimized XGB (ROC: AUC = 0.995; PRC: AUC = 1.000, APS = 1.000; balance accuracy: 0.990) has the best performance, followed by LGB (ROC: AUC = 0.992; PRC: AUC = 0.997, APS = 0.977; balance accuracy: 0.957). The worst performance is naive Bayes model (ROC: AUC = 0.799; PRC: AUC = 0.822, APS = 0.823; balance accuracy: 0.693). In the composite feature importance bar plots, ASO, RF, Up/Ucr, and other features play important roles in LN. CONCLUSION: We developed and validated a new and simple machine learning pathway for diagnosis of LN, especially the XGB model based on ASO, LA1, LA2, proteinuria, and other features screened out by collective feature selection. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00011-023-01755-7.
format Online
Article
Text
id pubmed-10257380
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-102573802023-06-12 Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis Wang, Da-Cheng Xu, Wang-Dong Wang, Shen-Nan Wang, Xiang Leng, Wei Fu, Lu Liu, Xiao-Yan Qin, Zhen Huang, An-Fang Inflamm Res Original Research Paper OBJECTIVE: Diagnosis of lupus nephritis (LN) is a complex process, which usually requires renal biopsy. We aim to establish a machine learning pipeline to help diagnosis of LN. METHODS: A cohort of 681 systemic lupus erythematosus (SLE) patients without LN and 786 SLE patients with LN was established, and a total of 95 clinical, laboratory data and 17 meteorological indicators were collected. After tenfold cross-validation, the patients were divided into training set and test set. The features selected by collective feature selection method of mutual information (MI) and multisurf were used to construct the models of logistic regression, decision tree, random forest, naive Bayes, support vector machine (SVM), light gradient boosting (LGB), extreme gradient boosting (XGB), and artificial neural network (ANN), the models were compared and verified in post-analysis. RESULTS: Collective feature selection method screens out antistreptolysin (ASO), retinol binding protein (RBP), lupus anticoagulant 1 (LA1), LA2, proteinuria and other features, and the hyperparameter optimized XGB (ROC: AUC = 0.995; PRC: AUC = 1.000, APS = 1.000; balance accuracy: 0.990) has the best performance, followed by LGB (ROC: AUC = 0.992; PRC: AUC = 0.997, APS = 0.977; balance accuracy: 0.957). The worst performance is naive Bayes model (ROC: AUC = 0.799; PRC: AUC = 0.822, APS = 0.823; balance accuracy: 0.693). In the composite feature importance bar plots, ASO, RF, Up/Ucr, and other features play important roles in LN. CONCLUSION: We developed and validated a new and simple machine learning pathway for diagnosis of LN, especially the XGB model based on ASO, LA1, LA2, proteinuria, and other features screened out by collective feature selection. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00011-023-01755-7. Springer International Publishing 2023-06-10 /pmc/articles/PMC10257380/ /pubmed/37300586 http://dx.doi.org/10.1007/s00011-023-01755-7 Text en © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research Paper
Wang, Da-Cheng
Xu, Wang-Dong
Wang, Shen-Nan
Wang, Xiang
Leng, Wei
Fu, Lu
Liu, Xiao-Yan
Qin, Zhen
Huang, An-Fang
Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis
title Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis
title_full Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis
title_fullStr Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis
title_full_unstemmed Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis
title_short Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis
title_sort lupus nephritis or not? a simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis
topic Original Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10257380/
https://www.ncbi.nlm.nih.gov/pubmed/37300586
http://dx.doi.org/10.1007/s00011-023-01755-7
work_keys_str_mv AT wangdacheng lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT xuwangdong lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT wangshennan lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT wangxiang lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT lengwei lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT fulu lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT liuxiaoyan lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT qinzhen lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis
AT huanganfang lupusnephritisornotasimpleandclinicallyfriendlymachinelearningpipelinetohelpdiagnosisoflupusnephritis