Cargando…

A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus

Background: Meta-analysis is a widely used tool in which weighted information from multiple similar studies is aggregated to increase statistical power. However, the exponential growth of publications in key areas of medical science has rendered manual identification of relevant studies increasingly...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Zhaohan, Liu, Tong, Tse, Gary, Gong, Mengqi, Gladding, Patrick A., Smaill, Bruce H., Stiles, Martin K., Gillis, Anne M., Zhao, Jichao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6037848/
https://www.ncbi.nlm.nih.gov/pubmed/30018571
http://dx.doi.org/10.3389/fphys.2018.00835
_version_ 1783338396902490112
author Xiong, Zhaohan
Liu, Tong
Tse, Gary
Gong, Mengqi
Gladding, Patrick A.
Smaill, Bruce H.
Stiles, Martin K.
Gillis, Anne M.
Zhao, Jichao
author_facet Xiong, Zhaohan
Liu, Tong
Tse, Gary
Gong, Mengqi
Gladding, Patrick A.
Smaill, Bruce H.
Stiles, Martin K.
Gillis, Anne M.
Zhao, Jichao
author_sort Xiong, Zhaohan
collection PubMed
description Background: Meta-analysis is a widely used tool in which weighted information from multiple similar studies is aggregated to increase statistical power. However, the exponential growth of publications in key areas of medical science has rendered manual identification of relevant studies increasingly time-consuming. The aim of this work was to develop a machine learning technique capable of robust automatic study selection for meta-analysis. We have validated this approach with an up-to-date meta-analysis to investigate the association between diabetes mellitus (DM) and new-onset atrial fibrillation (AF). Methods: The PubMed online database was searched from 1960 to September 2017 where 4,177 publications that mentioned both DM and AF were identified. Relevant studies were selected as follows. First, publications were clustered based on common text features using an unsupervised K-means algorithm. Clusters that best matched the selected set of potentially relevant studies (a “training” set of 139 articles) were then identified by using maximum entropy classification. The 139 articles selected automatically on this basis were screened manually to identify potentially relevant studies. To determine the validity of the automated process, a parallel set of studies was also assembled by manually screening all initially searched publications. Finally, detailed manual selection was performed on the full texts of the studies in both sets using standard criteria. Quality assessment, meta-regression random-effects models, sensitivity analysis and publication bias assessment were then conducted. Results: Machine learning-assisted screening identified the same 29 studies for meta-analysis as those identified by using manual screening alone. Machine learning enabled more robust and efficient study selection, reducing the number of studies needed for manual screening from 4,177 to 556 articles. A pooled analysis using the most conservative estimates indicated that patients with DM had ~49% greater risk of developing AF compared with individuals without DM. After adjusting for three additional risk factors i.e., hypertension, obesity and heart disease, the relative risk was 23%. Using multivariate adjusted models, the risk for developing AF in patients with DM was similar for all DM subtypes. Women with DM were 24% more likely to develop AF than men with DM. The risk for new-onset AF in patients with DM has also increased over the years. Conclusions: We have developed a novel machine learning method to identify publications suitable for inclusion in meta-analysis.This approach has the capacity to provide for a more efficient and more objective study selection process for future such studies. We have used it to demonstrate that DM is a strong, independent risk factor for AF, particularly for women.
format Online
Article
Text
id pubmed-6037848
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-60378482018-07-17 A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus Xiong, Zhaohan Liu, Tong Tse, Gary Gong, Mengqi Gladding, Patrick A. Smaill, Bruce H. Stiles, Martin K. Gillis, Anne M. Zhao, Jichao Front Physiol Physiology Background: Meta-analysis is a widely used tool in which weighted information from multiple similar studies is aggregated to increase statistical power. However, the exponential growth of publications in key areas of medical science has rendered manual identification of relevant studies increasingly time-consuming. The aim of this work was to develop a machine learning technique capable of robust automatic study selection for meta-analysis. We have validated this approach with an up-to-date meta-analysis to investigate the association between diabetes mellitus (DM) and new-onset atrial fibrillation (AF). Methods: The PubMed online database was searched from 1960 to September 2017 where 4,177 publications that mentioned both DM and AF were identified. Relevant studies were selected as follows. First, publications were clustered based on common text features using an unsupervised K-means algorithm. Clusters that best matched the selected set of potentially relevant studies (a “training” set of 139 articles) were then identified by using maximum entropy classification. The 139 articles selected automatically on this basis were screened manually to identify potentially relevant studies. To determine the validity of the automated process, a parallel set of studies was also assembled by manually screening all initially searched publications. Finally, detailed manual selection was performed on the full texts of the studies in both sets using standard criteria. Quality assessment, meta-regression random-effects models, sensitivity analysis and publication bias assessment were then conducted. Results: Machine learning-assisted screening identified the same 29 studies for meta-analysis as those identified by using manual screening alone. Machine learning enabled more robust and efficient study selection, reducing the number of studies needed for manual screening from 4,177 to 556 articles. A pooled analysis using the most conservative estimates indicated that patients with DM had ~49% greater risk of developing AF compared with individuals without DM. After adjusting for three additional risk factors i.e., hypertension, obesity and heart disease, the relative risk was 23%. Using multivariate adjusted models, the risk for developing AF in patients with DM was similar for all DM subtypes. Women with DM were 24% more likely to develop AF than men with DM. The risk for new-onset AF in patients with DM has also increased over the years. Conclusions: We have developed a novel machine learning method to identify publications suitable for inclusion in meta-analysis.This approach has the capacity to provide for a more efficient and more objective study selection process for future such studies. We have used it to demonstrate that DM is a strong, independent risk factor for AF, particularly for women. Frontiers Media S.A. 2018-07-03 /pmc/articles/PMC6037848/ /pubmed/30018571 http://dx.doi.org/10.3389/fphys.2018.00835 Text en Copyright © 2018 Xiong, Liu, Tse, Gong, Gladding, Smaill, Stiles, Gillis and Zhao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Physiology
Xiong, Zhaohan
Liu, Tong
Tse, Gary
Gong, Mengqi
Gladding, Patrick A.
Smaill, Bruce H.
Stiles, Martin K.
Gillis, Anne M.
Zhao, Jichao
A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus
title A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus
title_full A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus
title_fullStr A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus
title_full_unstemmed A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus
title_short A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus
title_sort machine learning aided systematic review and meta-analysis of the relative risk of atrial fibrillation in patients with diabetes mellitus
topic Physiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6037848/
https://www.ncbi.nlm.nih.gov/pubmed/30018571
http://dx.doi.org/10.3389/fphys.2018.00835
work_keys_str_mv AT xiongzhaohan amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT liutong amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT tsegary amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT gongmengqi amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT gladdingpatricka amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT smaillbruceh amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT stilesmartink amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT gillisannem amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT zhaojichao amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT xiongzhaohan machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT liutong machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT tsegary machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT gongmengqi machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT gladdingpatricka machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT smaillbruceh machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT stilesmartink machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT gillisannem machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus
AT zhaojichao machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus