Cargando…
A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus
Background: Meta-analysis is a widely used tool in which weighted information from multiple similar studies is aggregated to increase statistical power. However, the exponential growth of publications in key areas of medical science has rendered manual identification of relevant studies increasingly...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6037848/ https://www.ncbi.nlm.nih.gov/pubmed/30018571 http://dx.doi.org/10.3389/fphys.2018.00835 |
_version_ | 1783338396902490112 |
---|---|
author | Xiong, Zhaohan Liu, Tong Tse, Gary Gong, Mengqi Gladding, Patrick A. Smaill, Bruce H. Stiles, Martin K. Gillis, Anne M. Zhao, Jichao |
author_facet | Xiong, Zhaohan Liu, Tong Tse, Gary Gong, Mengqi Gladding, Patrick A. Smaill, Bruce H. Stiles, Martin K. Gillis, Anne M. Zhao, Jichao |
author_sort | Xiong, Zhaohan |
collection | PubMed |
description | Background: Meta-analysis is a widely used tool in which weighted information from multiple similar studies is aggregated to increase statistical power. However, the exponential growth of publications in key areas of medical science has rendered manual identification of relevant studies increasingly time-consuming. The aim of this work was to develop a machine learning technique capable of robust automatic study selection for meta-analysis. We have validated this approach with an up-to-date meta-analysis to investigate the association between diabetes mellitus (DM) and new-onset atrial fibrillation (AF). Methods: The PubMed online database was searched from 1960 to September 2017 where 4,177 publications that mentioned both DM and AF were identified. Relevant studies were selected as follows. First, publications were clustered based on common text features using an unsupervised K-means algorithm. Clusters that best matched the selected set of potentially relevant studies (a “training” set of 139 articles) were then identified by using maximum entropy classification. The 139 articles selected automatically on this basis were screened manually to identify potentially relevant studies. To determine the validity of the automated process, a parallel set of studies was also assembled by manually screening all initially searched publications. Finally, detailed manual selection was performed on the full texts of the studies in both sets using standard criteria. Quality assessment, meta-regression random-effects models, sensitivity analysis and publication bias assessment were then conducted. Results: Machine learning-assisted screening identified the same 29 studies for meta-analysis as those identified by using manual screening alone. Machine learning enabled more robust and efficient study selection, reducing the number of studies needed for manual screening from 4,177 to 556 articles. A pooled analysis using the most conservative estimates indicated that patients with DM had ~49% greater risk of developing AF compared with individuals without DM. After adjusting for three additional risk factors i.e., hypertension, obesity and heart disease, the relative risk was 23%. Using multivariate adjusted models, the risk for developing AF in patients with DM was similar for all DM subtypes. Women with DM were 24% more likely to develop AF than men with DM. The risk for new-onset AF in patients with DM has also increased over the years. Conclusions: We have developed a novel machine learning method to identify publications suitable for inclusion in meta-analysis.This approach has the capacity to provide for a more efficient and more objective study selection process for future such studies. We have used it to demonstrate that DM is a strong, independent risk factor for AF, particularly for women. |
format | Online Article Text |
id | pubmed-6037848 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-60378482018-07-17 A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus Xiong, Zhaohan Liu, Tong Tse, Gary Gong, Mengqi Gladding, Patrick A. Smaill, Bruce H. Stiles, Martin K. Gillis, Anne M. Zhao, Jichao Front Physiol Physiology Background: Meta-analysis is a widely used tool in which weighted information from multiple similar studies is aggregated to increase statistical power. However, the exponential growth of publications in key areas of medical science has rendered manual identification of relevant studies increasingly time-consuming. The aim of this work was to develop a machine learning technique capable of robust automatic study selection for meta-analysis. We have validated this approach with an up-to-date meta-analysis to investigate the association between diabetes mellitus (DM) and new-onset atrial fibrillation (AF). Methods: The PubMed online database was searched from 1960 to September 2017 where 4,177 publications that mentioned both DM and AF were identified. Relevant studies were selected as follows. First, publications were clustered based on common text features using an unsupervised K-means algorithm. Clusters that best matched the selected set of potentially relevant studies (a “training” set of 139 articles) were then identified by using maximum entropy classification. The 139 articles selected automatically on this basis were screened manually to identify potentially relevant studies. To determine the validity of the automated process, a parallel set of studies was also assembled by manually screening all initially searched publications. Finally, detailed manual selection was performed on the full texts of the studies in both sets using standard criteria. Quality assessment, meta-regression random-effects models, sensitivity analysis and publication bias assessment were then conducted. Results: Machine learning-assisted screening identified the same 29 studies for meta-analysis as those identified by using manual screening alone. Machine learning enabled more robust and efficient study selection, reducing the number of studies needed for manual screening from 4,177 to 556 articles. A pooled analysis using the most conservative estimates indicated that patients with DM had ~49% greater risk of developing AF compared with individuals without DM. After adjusting for three additional risk factors i.e., hypertension, obesity and heart disease, the relative risk was 23%. Using multivariate adjusted models, the risk for developing AF in patients with DM was similar for all DM subtypes. Women with DM were 24% more likely to develop AF than men with DM. The risk for new-onset AF in patients with DM has also increased over the years. Conclusions: We have developed a novel machine learning method to identify publications suitable for inclusion in meta-analysis.This approach has the capacity to provide for a more efficient and more objective study selection process for future such studies. We have used it to demonstrate that DM is a strong, independent risk factor for AF, particularly for women. Frontiers Media S.A. 2018-07-03 /pmc/articles/PMC6037848/ /pubmed/30018571 http://dx.doi.org/10.3389/fphys.2018.00835 Text en Copyright © 2018 Xiong, Liu, Tse, Gong, Gladding, Smaill, Stiles, Gillis and Zhao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Physiology Xiong, Zhaohan Liu, Tong Tse, Gary Gong, Mengqi Gladding, Patrick A. Smaill, Bruce H. Stiles, Martin K. Gillis, Anne M. Zhao, Jichao A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus |
title | A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus |
title_full | A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus |
title_fullStr | A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus |
title_full_unstemmed | A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus |
title_short | A Machine Learning Aided Systematic Review and Meta-Analysis of the Relative Risk of Atrial Fibrillation in Patients With Diabetes Mellitus |
title_sort | machine learning aided systematic review and meta-analysis of the relative risk of atrial fibrillation in patients with diabetes mellitus |
topic | Physiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6037848/ https://www.ncbi.nlm.nih.gov/pubmed/30018571 http://dx.doi.org/10.3389/fphys.2018.00835 |
work_keys_str_mv | AT xiongzhaohan amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT liutong amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT tsegary amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT gongmengqi amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT gladdingpatricka amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT smaillbruceh amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT stilesmartink amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT gillisannem amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT zhaojichao amachinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT xiongzhaohan machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT liutong machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT tsegary machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT gongmengqi machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT gladdingpatricka machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT smaillbruceh machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT stilesmartink machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT gillisannem machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus AT zhaojichao machinelearningaidedsystematicreviewandmetaanalysisoftherelativeriskofatrialfibrillationinpatientswithdiabetesmellitus |