Cargando…

Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population

BACKGROUND: Thalidomide is an effective treatment for refractory Crohn’s disease (CD). However, thalidomide-induced peripheral neuropathy (TiPN), which has a large individual variation, is a major cause of treatment failure. TiPN is rarely predictable and recognized, especially in CD. It is necessar...

Descripción completa

Detalles Bibliográficos
Autores principales: Mao, Jing, Chao, Kang, Jiang, Fu-Lin, Ye, Xiao-Ping, Yang, Ting, Li, Pan, Zhu, Xia, Hu, Pin-Jin, Zhou, Bai-Jun, Huang, Min, Gao, Xiang, Wang, Xue-Ding
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Baishideng Publishing Group Inc 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10324537/
https://www.ncbi.nlm.nih.gov/pubmed/37426324
http://dx.doi.org/10.3748/wjg.v29.i24.3855
_version_ 1785069170774245376
author Mao, Jing
Chao, Kang
Jiang, Fu-Lin
Ye, Xiao-Ping
Yang, Ting
Li, Pan
Zhu, Xia
Hu, Pin-Jin
Zhou, Bai-Jun
Huang, Min
Gao, Xiang
Wang, Xue-Ding
author_facet Mao, Jing
Chao, Kang
Jiang, Fu-Lin
Ye, Xiao-Ping
Yang, Ting
Li, Pan
Zhu, Xia
Hu, Pin-Jin
Zhou, Bai-Jun
Huang, Min
Gao, Xiang
Wang, Xue-Ding
author_sort Mao, Jing
collection PubMed
description BACKGROUND: Thalidomide is an effective treatment for refractory Crohn’s disease (CD). However, thalidomide-induced peripheral neuropathy (TiPN), which has a large individual variation, is a major cause of treatment failure. TiPN is rarely predictable and recognized, especially in CD. It is necessary to develop a risk model to predict TiPN occurrence. AIM: To develop and compare a predictive model of TiPN using machine learning based on comprehensive clinical and genetic variables. METHODS: A retrospective cohort of 164 CD patients from January 2016 to June 2022 was used to establish the model. The National Cancer Institute Common Toxicity Criteria Sensory Scale (version 4.0) was used to assess TiPN. With 18 clinical features and 150 genetic variables, five predictive models were established and evaluated by the confusion matrix receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), specificity, sensitivity (recall rate), precision, accuracy, and F1 score. RESULTS: The top-ranking five risk variables associated with TiPN were interleukin-12 rs1353248 [P = 0.0004, odds ratio (OR): 8.983, 95% confidence interval (CI): 2.497-30.90], dose (mg/d, P = 0.002), brain-derived neurotrophic factor (BDNF) rs2030324 (P = 0.001, OR: 3.164, 95%CI: 1.561-6.434), BDNF rs6265 (P = 0.001, OR: 3.150, 95%CI: 1.546-6.073) and BDNF rs11030104 (P = 0.001, OR: 3.091, 95%CI: 1.525-5.960). In the training set, gradient boosting decision tree (GBDT), extremely random trees (ET), random forest, logistic regression and extreme gradient boosting (XGBoost) obtained AUROC values > 0.90 and AUPRC > 0.87. Among these models, XGBoost and GBDT obtained the first two highest AUROC (0.90 and 1), AUPRC (0.98 and 1), accuracy (0.96 and 0.98), precision (0.90 and 0.95), F1 score (0.95 and 0.98), specificity (0.94 and 0.97), and sensitivity (1). In the validation set, XGBoost algorithm exhibited the best predictive performance with the highest specificity (0.857), accuracy (0.818), AUPRC (0.86) and AUROC (0.89). ET and GBDT obtained the highest sensitivity (1) and F1 score (0.8). Overall, compared with other state-of-the-art classifiers such as ET, GBDT and RF, XGBoost algorithm not only showed a more stable performance, but also yielded higher ROC-AUC and PRC-AUC scores, demonstrating its high accuracy in prediction of TiPN occurrence. CONCLUSION: The powerful XGBoost algorithm accurately predicts TiPN using 18 clinical features and 14 genetic variables. With the ability to identify high-risk patients using single nucleotide polymorphisms, it offers a feasible option for improving thalidomide efficacy in CD patients.
format Online
Article
Text
id pubmed-10324537
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Baishideng Publishing Group Inc
record_format MEDLINE/PubMed
spelling pubmed-103245372023-07-07 Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population Mao, Jing Chao, Kang Jiang, Fu-Lin Ye, Xiao-Ping Yang, Ting Li, Pan Zhu, Xia Hu, Pin-Jin Zhou, Bai-Jun Huang, Min Gao, Xiang Wang, Xue-Ding World J Gastroenterol Retrospective Study BACKGROUND: Thalidomide is an effective treatment for refractory Crohn’s disease (CD). However, thalidomide-induced peripheral neuropathy (TiPN), which has a large individual variation, is a major cause of treatment failure. TiPN is rarely predictable and recognized, especially in CD. It is necessary to develop a risk model to predict TiPN occurrence. AIM: To develop and compare a predictive model of TiPN using machine learning based on comprehensive clinical and genetic variables. METHODS: A retrospective cohort of 164 CD patients from January 2016 to June 2022 was used to establish the model. The National Cancer Institute Common Toxicity Criteria Sensory Scale (version 4.0) was used to assess TiPN. With 18 clinical features and 150 genetic variables, five predictive models were established and evaluated by the confusion matrix receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), specificity, sensitivity (recall rate), precision, accuracy, and F1 score. RESULTS: The top-ranking five risk variables associated with TiPN were interleukin-12 rs1353248 [P = 0.0004, odds ratio (OR): 8.983, 95% confidence interval (CI): 2.497-30.90], dose (mg/d, P = 0.002), brain-derived neurotrophic factor (BDNF) rs2030324 (P = 0.001, OR: 3.164, 95%CI: 1.561-6.434), BDNF rs6265 (P = 0.001, OR: 3.150, 95%CI: 1.546-6.073) and BDNF rs11030104 (P = 0.001, OR: 3.091, 95%CI: 1.525-5.960). In the training set, gradient boosting decision tree (GBDT), extremely random trees (ET), random forest, logistic regression and extreme gradient boosting (XGBoost) obtained AUROC values > 0.90 and AUPRC > 0.87. Among these models, XGBoost and GBDT obtained the first two highest AUROC (0.90 and 1), AUPRC (0.98 and 1), accuracy (0.96 and 0.98), precision (0.90 and 0.95), F1 score (0.95 and 0.98), specificity (0.94 and 0.97), and sensitivity (1). In the validation set, XGBoost algorithm exhibited the best predictive performance with the highest specificity (0.857), accuracy (0.818), AUPRC (0.86) and AUROC (0.89). ET and GBDT obtained the highest sensitivity (1) and F1 score (0.8). Overall, compared with other state-of-the-art classifiers such as ET, GBDT and RF, XGBoost algorithm not only showed a more stable performance, but also yielded higher ROC-AUC and PRC-AUC scores, demonstrating its high accuracy in prediction of TiPN occurrence. CONCLUSION: The powerful XGBoost algorithm accurately predicts TiPN using 18 clinical features and 14 genetic variables. With the ability to identify high-risk patients using single nucleotide polymorphisms, it offers a feasible option for improving thalidomide efficacy in CD patients. Baishideng Publishing Group Inc 2023-06-28 2023-06-28 /pmc/articles/PMC10324537/ /pubmed/37426324 http://dx.doi.org/10.3748/wjg.v29.i24.3855 Text en ©The Author(s) 2023. Published by Baishideng Publishing Group Inc. All rights reserved. https://creativecommons.org/licenses/by-nc/4.0/This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial.
spellingShingle Retrospective Study
Mao, Jing
Chao, Kang
Jiang, Fu-Lin
Ye, Xiao-Ping
Yang, Ting
Li, Pan
Zhu, Xia
Hu, Pin-Jin
Zhou, Bai-Jun
Huang, Min
Gao, Xiang
Wang, Xue-Ding
Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population
title Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population
title_full Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population
title_fullStr Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population
title_full_unstemmed Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population
title_short Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population
title_sort comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory crohn’s disease in chinese population
topic Retrospective Study
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10324537/
https://www.ncbi.nlm.nih.gov/pubmed/37426324
http://dx.doi.org/10.3748/wjg.v29.i24.3855
work_keys_str_mv AT maojing comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT chaokang comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT jiangfulin comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT yexiaoping comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT yangting comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT lipan comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT zhuxia comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT hupinjin comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT zhoubaijun comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT huangmin comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT gaoxiang comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation
AT wangxueding comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation