Cargando…
Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population
BACKGROUND: Thalidomide is an effective treatment for refractory Crohn’s disease (CD). However, thalidomide-induced peripheral neuropathy (TiPN), which has a large individual variation, is a major cause of treatment failure. TiPN is rarely predictable and recognized, especially in CD. It is necessar...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Baishideng Publishing Group Inc
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10324537/ https://www.ncbi.nlm.nih.gov/pubmed/37426324 http://dx.doi.org/10.3748/wjg.v29.i24.3855 |
_version_ | 1785069170774245376 |
---|---|
author | Mao, Jing Chao, Kang Jiang, Fu-Lin Ye, Xiao-Ping Yang, Ting Li, Pan Zhu, Xia Hu, Pin-Jin Zhou, Bai-Jun Huang, Min Gao, Xiang Wang, Xue-Ding |
author_facet | Mao, Jing Chao, Kang Jiang, Fu-Lin Ye, Xiao-Ping Yang, Ting Li, Pan Zhu, Xia Hu, Pin-Jin Zhou, Bai-Jun Huang, Min Gao, Xiang Wang, Xue-Ding |
author_sort | Mao, Jing |
collection | PubMed |
description | BACKGROUND: Thalidomide is an effective treatment for refractory Crohn’s disease (CD). However, thalidomide-induced peripheral neuropathy (TiPN), which has a large individual variation, is a major cause of treatment failure. TiPN is rarely predictable and recognized, especially in CD. It is necessary to develop a risk model to predict TiPN occurrence. AIM: To develop and compare a predictive model of TiPN using machine learning based on comprehensive clinical and genetic variables. METHODS: A retrospective cohort of 164 CD patients from January 2016 to June 2022 was used to establish the model. The National Cancer Institute Common Toxicity Criteria Sensory Scale (version 4.0) was used to assess TiPN. With 18 clinical features and 150 genetic variables, five predictive models were established and evaluated by the confusion matrix receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), specificity, sensitivity (recall rate), precision, accuracy, and F1 score. RESULTS: The top-ranking five risk variables associated with TiPN were interleukin-12 rs1353248 [P = 0.0004, odds ratio (OR): 8.983, 95% confidence interval (CI): 2.497-30.90], dose (mg/d, P = 0.002), brain-derived neurotrophic factor (BDNF) rs2030324 (P = 0.001, OR: 3.164, 95%CI: 1.561-6.434), BDNF rs6265 (P = 0.001, OR: 3.150, 95%CI: 1.546-6.073) and BDNF rs11030104 (P = 0.001, OR: 3.091, 95%CI: 1.525-5.960). In the training set, gradient boosting decision tree (GBDT), extremely random trees (ET), random forest, logistic regression and extreme gradient boosting (XGBoost) obtained AUROC values > 0.90 and AUPRC > 0.87. Among these models, XGBoost and GBDT obtained the first two highest AUROC (0.90 and 1), AUPRC (0.98 and 1), accuracy (0.96 and 0.98), precision (0.90 and 0.95), F1 score (0.95 and 0.98), specificity (0.94 and 0.97), and sensitivity (1). In the validation set, XGBoost algorithm exhibited the best predictive performance with the highest specificity (0.857), accuracy (0.818), AUPRC (0.86) and AUROC (0.89). ET and GBDT obtained the highest sensitivity (1) and F1 score (0.8). Overall, compared with other state-of-the-art classifiers such as ET, GBDT and RF, XGBoost algorithm not only showed a more stable performance, but also yielded higher ROC-AUC and PRC-AUC scores, demonstrating its high accuracy in prediction of TiPN occurrence. CONCLUSION: The powerful XGBoost algorithm accurately predicts TiPN using 18 clinical features and 14 genetic variables. With the ability to identify high-risk patients using single nucleotide polymorphisms, it offers a feasible option for improving thalidomide efficacy in CD patients. |
format | Online Article Text |
id | pubmed-10324537 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Baishideng Publishing Group Inc |
record_format | MEDLINE/PubMed |
spelling | pubmed-103245372023-07-07 Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population Mao, Jing Chao, Kang Jiang, Fu-Lin Ye, Xiao-Ping Yang, Ting Li, Pan Zhu, Xia Hu, Pin-Jin Zhou, Bai-Jun Huang, Min Gao, Xiang Wang, Xue-Ding World J Gastroenterol Retrospective Study BACKGROUND: Thalidomide is an effective treatment for refractory Crohn’s disease (CD). However, thalidomide-induced peripheral neuropathy (TiPN), which has a large individual variation, is a major cause of treatment failure. TiPN is rarely predictable and recognized, especially in CD. It is necessary to develop a risk model to predict TiPN occurrence. AIM: To develop and compare a predictive model of TiPN using machine learning based on comprehensive clinical and genetic variables. METHODS: A retrospective cohort of 164 CD patients from January 2016 to June 2022 was used to establish the model. The National Cancer Institute Common Toxicity Criteria Sensory Scale (version 4.0) was used to assess TiPN. With 18 clinical features and 150 genetic variables, five predictive models were established and evaluated by the confusion matrix receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), specificity, sensitivity (recall rate), precision, accuracy, and F1 score. RESULTS: The top-ranking five risk variables associated with TiPN were interleukin-12 rs1353248 [P = 0.0004, odds ratio (OR): 8.983, 95% confidence interval (CI): 2.497-30.90], dose (mg/d, P = 0.002), brain-derived neurotrophic factor (BDNF) rs2030324 (P = 0.001, OR: 3.164, 95%CI: 1.561-6.434), BDNF rs6265 (P = 0.001, OR: 3.150, 95%CI: 1.546-6.073) and BDNF rs11030104 (P = 0.001, OR: 3.091, 95%CI: 1.525-5.960). In the training set, gradient boosting decision tree (GBDT), extremely random trees (ET), random forest, logistic regression and extreme gradient boosting (XGBoost) obtained AUROC values > 0.90 and AUPRC > 0.87. Among these models, XGBoost and GBDT obtained the first two highest AUROC (0.90 and 1), AUPRC (0.98 and 1), accuracy (0.96 and 0.98), precision (0.90 and 0.95), F1 score (0.95 and 0.98), specificity (0.94 and 0.97), and sensitivity (1). In the validation set, XGBoost algorithm exhibited the best predictive performance with the highest specificity (0.857), accuracy (0.818), AUPRC (0.86) and AUROC (0.89). ET and GBDT obtained the highest sensitivity (1) and F1 score (0.8). Overall, compared with other state-of-the-art classifiers such as ET, GBDT and RF, XGBoost algorithm not only showed a more stable performance, but also yielded higher ROC-AUC and PRC-AUC scores, demonstrating its high accuracy in prediction of TiPN occurrence. CONCLUSION: The powerful XGBoost algorithm accurately predicts TiPN using 18 clinical features and 14 genetic variables. With the ability to identify high-risk patients using single nucleotide polymorphisms, it offers a feasible option for improving thalidomide efficacy in CD patients. Baishideng Publishing Group Inc 2023-06-28 2023-06-28 /pmc/articles/PMC10324537/ /pubmed/37426324 http://dx.doi.org/10.3748/wjg.v29.i24.3855 Text en ©The Author(s) 2023. Published by Baishideng Publishing Group Inc. All rights reserved. https://creativecommons.org/licenses/by-nc/4.0/This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. |
spellingShingle | Retrospective Study Mao, Jing Chao, Kang Jiang, Fu-Lin Ye, Xiao-Ping Yang, Ting Li, Pan Zhu, Xia Hu, Pin-Jin Zhou, Bai-Jun Huang, Min Gao, Xiang Wang, Xue-Ding Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population |
title | Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population |
title_full | Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population |
title_fullStr | Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population |
title_full_unstemmed | Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population |
title_short | Comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory Crohn’s disease in Chinese population |
title_sort | comparison and development of machine learning for thalidomide-induced peripheral neuropathy prediction of refractory crohn’s disease in chinese population |
topic | Retrospective Study |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10324537/ https://www.ncbi.nlm.nih.gov/pubmed/37426324 http://dx.doi.org/10.3748/wjg.v29.i24.3855 |
work_keys_str_mv | AT maojing comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT chaokang comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT jiangfulin comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT yexiaoping comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT yangting comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT lipan comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT zhuxia comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT hupinjin comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT zhoubaijun comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT huangmin comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT gaoxiang comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation AT wangxueding comparisonanddevelopmentofmachinelearningforthalidomideinducedperipheralneuropathypredictionofrefractorycrohnsdiseaseinchinesepopulation |