Cargando…
A study of aortic dissection screening method based on multiple machine learning models
BACKGROUND: The main purpose of the study was to develop an early screening method for aortic dissection (AD) based on machine learning. Due to the rarity of AD and the complexity of symptoms, many doctors have no clinical experience with it. Many patients are not suspected of having AD, which lead...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7138971/ https://www.ncbi.nlm.nih.gov/pubmed/32274126 http://dx.doi.org/10.21037/jtd.2019.12.119 |
_version_ | 1783518661142642688 |
---|---|
author | Liu, Lijue Zhang, Caiwang Zhang, Guogang Gao, Yan Luo, Jingmin Zhang, Wei Li, Yi Mu, Yang |
author_facet | Liu, Lijue Zhang, Caiwang Zhang, Guogang Gao, Yan Luo, Jingmin Zhang, Wei Li, Yi Mu, Yang |
author_sort | Liu, Lijue |
collection | PubMed |
description | BACKGROUND: The main purpose of the study was to develop an early screening method for aortic dissection (AD) based on machine learning. Due to the rarity of AD and the complexity of symptoms, many doctors have no clinical experience with it. Many patients are not suspected of having AD, which lead to a high rate of misdiagnosis. Here, we report the preliminary study and feasibility of rapid and accurate screening method of AD with machine learning methods. METHODS: The dataset analyzed was composed by examination data provided by the Xiangya Hospital Central South University of China which include a total of 60,000 samples, including aortic patients and non-aortic ones. Each sample has 76 features which is consist of routine examinations and other easily accessible information. Since the proportion of people who are affected is usually imbalanced compared to non-diseased people, multiple machine learning models were used, include AdaBoost, SmoteBagging, EasyEnsemble and CalibratedAdaMEC. They used different methods such as ensemble learning, undersampling, oversampling, and cost-sensitivity to solve data imbalance problems. RESULTS: AdaBoost performed poorly with an average recall of 16.1% and a specificity of 99.8%. SmoteBagging achieved a statistically significant better performance for this problem with an average recall of 78.1% and a specificity of 79.2%. EasyEnsemble reached the values of 77.8% and 79.3% for recall and specificity respectively. CalibratedAdaMEC’s recall and specificity are 75.8% and 76%. CONCLUSIONS: It was found that the screening performance of the models evaluated in this paper had a misdiagnosis rate lower than 25% except AdaBoost. The data used in these methods are only routine inspection data. This means that machine learning methods can help us build a fast, cheap, worthwhile and effective early screening approach for AD. |
format | Online Article Text |
id | pubmed-7138971 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-71389712020-04-09 A study of aortic dissection screening method based on multiple machine learning models Liu, Lijue Zhang, Caiwang Zhang, Guogang Gao, Yan Luo, Jingmin Zhang, Wei Li, Yi Mu, Yang J Thorac Dis Original Article BACKGROUND: The main purpose of the study was to develop an early screening method for aortic dissection (AD) based on machine learning. Due to the rarity of AD and the complexity of symptoms, many doctors have no clinical experience with it. Many patients are not suspected of having AD, which lead to a high rate of misdiagnosis. Here, we report the preliminary study and feasibility of rapid and accurate screening method of AD with machine learning methods. METHODS: The dataset analyzed was composed by examination data provided by the Xiangya Hospital Central South University of China which include a total of 60,000 samples, including aortic patients and non-aortic ones. Each sample has 76 features which is consist of routine examinations and other easily accessible information. Since the proportion of people who are affected is usually imbalanced compared to non-diseased people, multiple machine learning models were used, include AdaBoost, SmoteBagging, EasyEnsemble and CalibratedAdaMEC. They used different methods such as ensemble learning, undersampling, oversampling, and cost-sensitivity to solve data imbalance problems. RESULTS: AdaBoost performed poorly with an average recall of 16.1% and a specificity of 99.8%. SmoteBagging achieved a statistically significant better performance for this problem with an average recall of 78.1% and a specificity of 79.2%. EasyEnsemble reached the values of 77.8% and 79.3% for recall and specificity respectively. CalibratedAdaMEC’s recall and specificity are 75.8% and 76%. CONCLUSIONS: It was found that the screening performance of the models evaluated in this paper had a misdiagnosis rate lower than 25% except AdaBoost. The data used in these methods are only routine inspection data. This means that machine learning methods can help us build a fast, cheap, worthwhile and effective early screening approach for AD. AME Publishing Company 2020-03 /pmc/articles/PMC7138971/ /pubmed/32274126 http://dx.doi.org/10.21037/jtd.2019.12.119 Text en 2020 Journal of Thoracic Disease. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Liu, Lijue Zhang, Caiwang Zhang, Guogang Gao, Yan Luo, Jingmin Zhang, Wei Li, Yi Mu, Yang A study of aortic dissection screening method based on multiple machine learning models |
title | A study of aortic dissection screening method based on multiple machine learning models |
title_full | A study of aortic dissection screening method based on multiple machine learning models |
title_fullStr | A study of aortic dissection screening method based on multiple machine learning models |
title_full_unstemmed | A study of aortic dissection screening method based on multiple machine learning models |
title_short | A study of aortic dissection screening method based on multiple machine learning models |
title_sort | study of aortic dissection screening method based on multiple machine learning models |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7138971/ https://www.ncbi.nlm.nih.gov/pubmed/32274126 http://dx.doi.org/10.21037/jtd.2019.12.119 |
work_keys_str_mv | AT liulijue astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT zhangcaiwang astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT zhangguogang astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT gaoyan astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT luojingmin astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT zhangwei astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT liyi astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT muyang astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT liulijue studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT zhangcaiwang studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT zhangguogang studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT gaoyan studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT luojingmin studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT zhangwei studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT liyi studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels AT muyang studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels |