Cargando…

Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data

Background: Assessment of cerebral aneurysm rupture risk is an important task, but it remains challenging. Recent works applying machine learning to rupture risk evaluation presented positive results. Yet they were based on limited aspects of data, and lack of interpretability may limit their use in...

Descripción completa

Detalles Bibliográficos
Autores principales: Ou, Chubin, Liu, Jiahui, Qian, Yi, Chong, Winston, Zhang, Xin, Liu, Wenchao, Su, Hengxian, Zhang, Nan, Zhang, Jianbo, Duan, Chuan-Zhi, He, Xuying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7785850/
https://www.ncbi.nlm.nih.gov/pubmed/33424738
http://dx.doi.org/10.3389/fneur.2020.570181
_version_ 1783632510568103936
author Ou, Chubin
Liu, Jiahui
Qian, Yi
Chong, Winston
Zhang, Xin
Liu, Wenchao
Su, Hengxian
Zhang, Nan
Zhang, Jianbo
Duan, Chuan-Zhi
He, Xuying
author_facet Ou, Chubin
Liu, Jiahui
Qian, Yi
Chong, Winston
Zhang, Xin
Liu, Wenchao
Su, Hengxian
Zhang, Nan
Zhang, Jianbo
Duan, Chuan-Zhi
He, Xuying
author_sort Ou, Chubin
collection PubMed
description Background: Assessment of cerebral aneurysm rupture risk is an important task, but it remains challenging. Recent works applying machine learning to rupture risk evaluation presented positive results. Yet they were based on limited aspects of data, and lack of interpretability may limit their use in clinical setting. We aimed to develop interpretable machine learning models on multidimensional data for aneurysm rupture risk assessment. Methods: Three hundred seventy-four aneurysms were included in the study. Demographic, medical history, lifestyle behaviors, lipid profile, and morphologies were collected for each patient. Prediction models were derived using machine learning methods (support vector machine, artificial neural network, and XGBoost) and conventional logistic regression. The derived models were compared with the PHASES score method. The Shapley Additive Explanations (SHAP) analysis was applied to improve the interpretability of the best machine learning model and reveal the reasoning behind the predictions made by the model. Results: The best machine learning model (XGBoost) achieved an area under the receiver operating characteristic curve of 0.882 [95% confidence interval (CI) = 0.838–0.927], significantly better than the logistic regression model (0.779; 95% CI = 0.729–0.829; P = 0.002) and the PHASES score method (0.758; 95% CI = 0.713–0.800; P = 0.001). Location, size ratio, and triglyceride level were the three most important features in predicting rupture. Two typical cases were analyzed to demonstrate the interpretability of the model. Conclusions: This study demonstrated the potential of using machine learning for aneurysm rupture risk assessment. Machine learning models performed better than conventional statistical model and the PHASES score method. The SHAP analysis can improve the interpretability of machine learning models and facilitate their use in a clinical setting.
format Online
Article
Text
id pubmed-7785850
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-77858502021-01-07 Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data Ou, Chubin Liu, Jiahui Qian, Yi Chong, Winston Zhang, Xin Liu, Wenchao Su, Hengxian Zhang, Nan Zhang, Jianbo Duan, Chuan-Zhi He, Xuying Front Neurol Neurology Background: Assessment of cerebral aneurysm rupture risk is an important task, but it remains challenging. Recent works applying machine learning to rupture risk evaluation presented positive results. Yet they were based on limited aspects of data, and lack of interpretability may limit their use in clinical setting. We aimed to develop interpretable machine learning models on multidimensional data for aneurysm rupture risk assessment. Methods: Three hundred seventy-four aneurysms were included in the study. Demographic, medical history, lifestyle behaviors, lipid profile, and morphologies were collected for each patient. Prediction models were derived using machine learning methods (support vector machine, artificial neural network, and XGBoost) and conventional logistic regression. The derived models were compared with the PHASES score method. The Shapley Additive Explanations (SHAP) analysis was applied to improve the interpretability of the best machine learning model and reveal the reasoning behind the predictions made by the model. Results: The best machine learning model (XGBoost) achieved an area under the receiver operating characteristic curve of 0.882 [95% confidence interval (CI) = 0.838–0.927], significantly better than the logistic regression model (0.779; 95% CI = 0.729–0.829; P = 0.002) and the PHASES score method (0.758; 95% CI = 0.713–0.800; P = 0.001). Location, size ratio, and triglyceride level were the three most important features in predicting rupture. Two typical cases were analyzed to demonstrate the interpretability of the model. Conclusions: This study demonstrated the potential of using machine learning for aneurysm rupture risk assessment. Machine learning models performed better than conventional statistical model and the PHASES score method. The SHAP analysis can improve the interpretability of machine learning models and facilitate their use in a clinical setting. Frontiers Media S.A. 2020-12-23 /pmc/articles/PMC7785850/ /pubmed/33424738 http://dx.doi.org/10.3389/fneur.2020.570181 Text en Copyright © 2020 Ou, Liu, Qian, Chong, Zhang, Liu, Su, Zhang, Zhang, Duan and He. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neurology
Ou, Chubin
Liu, Jiahui
Qian, Yi
Chong, Winston
Zhang, Xin
Liu, Wenchao
Su, Hengxian
Zhang, Nan
Zhang, Jianbo
Duan, Chuan-Zhi
He, Xuying
Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data
title Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data
title_full Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data
title_fullStr Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data
title_full_unstemmed Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data
title_short Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data
title_sort rupture risk assessment for cerebral aneurysm using interpretable machine learning on multidimensional data
topic Neurology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7785850/
https://www.ncbi.nlm.nih.gov/pubmed/33424738
http://dx.doi.org/10.3389/fneur.2020.570181
work_keys_str_mv AT ouchubin ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT liujiahui ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT qianyi ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT chongwinston ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT zhangxin ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT liuwenchao ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT suhengxian ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT zhangnan ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT zhangjianbo ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT duanchuanzhi ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata
AT hexuying ruptureriskassessmentforcerebralaneurysmusinginterpretablemachinelearningonmultidimensionaldata