Cargando…

A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data

Alpha-1 antitrypsin deficiency associated liver disease (AATD-LD) is a rare genetic disorder and not well-recognized. Predicting the clinical outcomes of AATD-LD and defining patients more likely to progress to advanced liver disease are crucial for better understanding AATD-LD progression and promo...

Descripción completa

Detalles Bibliográficos
Autores principales: Meng, Linxi, Treem, Will, Heap, Graham A., Chen, Jingjing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9554039/
https://www.ncbi.nlm.nih.gov/pubmed/36220873
http://dx.doi.org/10.1038/s41598-022-21389-9
_version_ 1784806606665416704
author Meng, Linxi
Treem, Will
Heap, Graham A.
Chen, Jingjing
author_facet Meng, Linxi
Treem, Will
Heap, Graham A.
Chen, Jingjing
author_sort Meng, Linxi
collection PubMed
description Alpha-1 antitrypsin deficiency associated liver disease (AATD-LD) is a rare genetic disorder and not well-recognized. Predicting the clinical outcomes of AATD-LD and defining patients more likely to progress to advanced liver disease are crucial for better understanding AATD-LD progression and promoting timely medical intervention. We aimed to develop a tailored machine learning (ML) model to predict the disease progression of AATD-LD. This analysis was conducted through a stacking ensemble learning model by combining five different ML algorithms with 58 predictor variables using nested five-fold cross-validation with repetitions based on the UK Biobank data. Performance of the model was assessed through prediction accuracy, area under the receiver operating characteristic (AUROC), and area under the precision-recall curve (AUPRC). The importance of predictor contributions was evaluated through a feature importance permutation method. The proposed stacking ensemble ML model showed clinically meaningful accuracy and appeared superior to any single ML algorithms in the ensemble, e.g., the AUROC for AATD-LD was 68.1%, 75.9%, 91.2%, and 67.7% for all-cause mortality, liver-related death, liver transplant, and all-cause mortality or liver transplant, respectively. This work supports the use of ML to address the unanswered clinical questions with clinically meaningful accuracy using real-world data.
format Online
Article
Text
id pubmed-9554039
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95540392022-10-13 A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data Meng, Linxi Treem, Will Heap, Graham A. Chen, Jingjing Sci Rep Article Alpha-1 antitrypsin deficiency associated liver disease (AATD-LD) is a rare genetic disorder and not well-recognized. Predicting the clinical outcomes of AATD-LD and defining patients more likely to progress to advanced liver disease are crucial for better understanding AATD-LD progression and promoting timely medical intervention. We aimed to develop a tailored machine learning (ML) model to predict the disease progression of AATD-LD. This analysis was conducted through a stacking ensemble learning model by combining five different ML algorithms with 58 predictor variables using nested five-fold cross-validation with repetitions based on the UK Biobank data. Performance of the model was assessed through prediction accuracy, area under the receiver operating characteristic (AUROC), and area under the precision-recall curve (AUPRC). The importance of predictor contributions was evaluated through a feature importance permutation method. The proposed stacking ensemble ML model showed clinically meaningful accuracy and appeared superior to any single ML algorithms in the ensemble, e.g., the AUROC for AATD-LD was 68.1%, 75.9%, 91.2%, and 67.7% for all-cause mortality, liver-related death, liver transplant, and all-cause mortality or liver transplant, respectively. This work supports the use of ML to address the unanswered clinical questions with clinically meaningful accuracy using real-world data. Nature Publishing Group UK 2022-10-11 /pmc/articles/PMC9554039/ /pubmed/36220873 http://dx.doi.org/10.1038/s41598-022-21389-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Meng, Linxi
Treem, Will
Heap, Graham A.
Chen, Jingjing
A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
title A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
title_full A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
title_fullStr A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
title_full_unstemmed A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
title_short A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
title_sort stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on uk biobank data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9554039/
https://www.ncbi.nlm.nih.gov/pubmed/36220873
http://dx.doi.org/10.1038/s41598-022-21389-9
work_keys_str_mv AT menglinxi astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata
AT treemwill astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata
AT heapgrahama astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata
AT chenjingjing astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata
AT menglinxi stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata
AT treemwill stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata
AT heapgrahama stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata
AT chenjingjing stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata