Cargando…

A study of aortic dissection screening method based on multiple machine learning models

BACKGROUND: The main purpose of the study was to develop an early screening method for aortic dissection (AD) based on machine learning. Due to the rarity of AD and the complexity of symptoms, many doctors have no clinical experience with it. Many patients are not suspected of having AD, which lead...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Lijue, Zhang, Caiwang, Zhang, Guogang, Gao, Yan, Luo, Jingmin, Zhang, Wei, Li, Yi, Mu, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7138971/
https://www.ncbi.nlm.nih.gov/pubmed/32274126
http://dx.doi.org/10.21037/jtd.2019.12.119
_version_ 1783518661142642688
author Liu, Lijue
Zhang, Caiwang
Zhang, Guogang
Gao, Yan
Luo, Jingmin
Zhang, Wei
Li, Yi
Mu, Yang
author_facet Liu, Lijue
Zhang, Caiwang
Zhang, Guogang
Gao, Yan
Luo, Jingmin
Zhang, Wei
Li, Yi
Mu, Yang
author_sort Liu, Lijue
collection PubMed
description BACKGROUND: The main purpose of the study was to develop an early screening method for aortic dissection (AD) based on machine learning. Due to the rarity of AD and the complexity of symptoms, many doctors have no clinical experience with it. Many patients are not suspected of having AD, which lead to a high rate of misdiagnosis. Here, we report the preliminary study and feasibility of rapid and accurate screening method of AD with machine learning methods. METHODS: The dataset analyzed was composed by examination data provided by the Xiangya Hospital Central South University of China which include a total of 60,000 samples, including aortic patients and non-aortic ones. Each sample has 76 features which is consist of routine examinations and other easily accessible information. Since the proportion of people who are affected is usually imbalanced compared to non-diseased people, multiple machine learning models were used, include AdaBoost, SmoteBagging, EasyEnsemble and CalibratedAdaMEC. They used different methods such as ensemble learning, undersampling, oversampling, and cost-sensitivity to solve data imbalance problems. RESULTS: AdaBoost performed poorly with an average recall of 16.1% and a specificity of 99.8%. SmoteBagging achieved a statistically significant better performance for this problem with an average recall of 78.1% and a specificity of 79.2%. EasyEnsemble reached the values of 77.8% and 79.3% for recall and specificity respectively. CalibratedAdaMEC’s recall and specificity are 75.8% and 76%. CONCLUSIONS: It was found that the screening performance of the models evaluated in this paper had a misdiagnosis rate lower than 25% except AdaBoost. The data used in these methods are only routine inspection data. This means that machine learning methods can help us build a fast, cheap, worthwhile and effective early screening approach for AD.
format Online
Article
Text
id pubmed-7138971
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-71389712020-04-09 A study of aortic dissection screening method based on multiple machine learning models Liu, Lijue Zhang, Caiwang Zhang, Guogang Gao, Yan Luo, Jingmin Zhang, Wei Li, Yi Mu, Yang J Thorac Dis Original Article BACKGROUND: The main purpose of the study was to develop an early screening method for aortic dissection (AD) based on machine learning. Due to the rarity of AD and the complexity of symptoms, many doctors have no clinical experience with it. Many patients are not suspected of having AD, which lead to a high rate of misdiagnosis. Here, we report the preliminary study and feasibility of rapid and accurate screening method of AD with machine learning methods. METHODS: The dataset analyzed was composed by examination data provided by the Xiangya Hospital Central South University of China which include a total of 60,000 samples, including aortic patients and non-aortic ones. Each sample has 76 features which is consist of routine examinations and other easily accessible information. Since the proportion of people who are affected is usually imbalanced compared to non-diseased people, multiple machine learning models were used, include AdaBoost, SmoteBagging, EasyEnsemble and CalibratedAdaMEC. They used different methods such as ensemble learning, undersampling, oversampling, and cost-sensitivity to solve data imbalance problems. RESULTS: AdaBoost performed poorly with an average recall of 16.1% and a specificity of 99.8%. SmoteBagging achieved a statistically significant better performance for this problem with an average recall of 78.1% and a specificity of 79.2%. EasyEnsemble reached the values of 77.8% and 79.3% for recall and specificity respectively. CalibratedAdaMEC’s recall and specificity are 75.8% and 76%. CONCLUSIONS: It was found that the screening performance of the models evaluated in this paper had a misdiagnosis rate lower than 25% except AdaBoost. The data used in these methods are only routine inspection data. This means that machine learning methods can help us build a fast, cheap, worthwhile and effective early screening approach for AD. AME Publishing Company 2020-03 /pmc/articles/PMC7138971/ /pubmed/32274126 http://dx.doi.org/10.21037/jtd.2019.12.119 Text en 2020 Journal of Thoracic Disease. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Liu, Lijue
Zhang, Caiwang
Zhang, Guogang
Gao, Yan
Luo, Jingmin
Zhang, Wei
Li, Yi
Mu, Yang
A study of aortic dissection screening method based on multiple machine learning models
title A study of aortic dissection screening method based on multiple machine learning models
title_full A study of aortic dissection screening method based on multiple machine learning models
title_fullStr A study of aortic dissection screening method based on multiple machine learning models
title_full_unstemmed A study of aortic dissection screening method based on multiple machine learning models
title_short A study of aortic dissection screening method based on multiple machine learning models
title_sort study of aortic dissection screening method based on multiple machine learning models
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7138971/
https://www.ncbi.nlm.nih.gov/pubmed/32274126
http://dx.doi.org/10.21037/jtd.2019.12.119
work_keys_str_mv AT liulijue astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT zhangcaiwang astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT zhangguogang astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT gaoyan astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT luojingmin astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT zhangwei astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT liyi astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT muyang astudyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT liulijue studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT zhangcaiwang studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT zhangguogang studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT gaoyan studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT luojingmin studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT zhangwei studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT liyi studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels
AT muyang studyofaorticdissectionscreeningmethodbasedonmultiplemachinelearningmodels