Cargando…
A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
Alpha-1 antitrypsin deficiency associated liver disease (AATD-LD) is a rare genetic disorder and not well-recognized. Predicting the clinical outcomes of AATD-LD and defining patients more likely to progress to advanced liver disease are crucial for better understanding AATD-LD progression and promo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9554039/ https://www.ncbi.nlm.nih.gov/pubmed/36220873 http://dx.doi.org/10.1038/s41598-022-21389-9 |
_version_ | 1784806606665416704 |
---|---|
author | Meng, Linxi Treem, Will Heap, Graham A. Chen, Jingjing |
author_facet | Meng, Linxi Treem, Will Heap, Graham A. Chen, Jingjing |
author_sort | Meng, Linxi |
collection | PubMed |
description | Alpha-1 antitrypsin deficiency associated liver disease (AATD-LD) is a rare genetic disorder and not well-recognized. Predicting the clinical outcomes of AATD-LD and defining patients more likely to progress to advanced liver disease are crucial for better understanding AATD-LD progression and promoting timely medical intervention. We aimed to develop a tailored machine learning (ML) model to predict the disease progression of AATD-LD. This analysis was conducted through a stacking ensemble learning model by combining five different ML algorithms with 58 predictor variables using nested five-fold cross-validation with repetitions based on the UK Biobank data. Performance of the model was assessed through prediction accuracy, area under the receiver operating characteristic (AUROC), and area under the precision-recall curve (AUPRC). The importance of predictor contributions was evaluated through a feature importance permutation method. The proposed stacking ensemble ML model showed clinically meaningful accuracy and appeared superior to any single ML algorithms in the ensemble, e.g., the AUROC for AATD-LD was 68.1%, 75.9%, 91.2%, and 67.7% for all-cause mortality, liver-related death, liver transplant, and all-cause mortality or liver transplant, respectively. This work supports the use of ML to address the unanswered clinical questions with clinically meaningful accuracy using real-world data. |
format | Online Article Text |
id | pubmed-9554039 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-95540392022-10-13 A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data Meng, Linxi Treem, Will Heap, Graham A. Chen, Jingjing Sci Rep Article Alpha-1 antitrypsin deficiency associated liver disease (AATD-LD) is a rare genetic disorder and not well-recognized. Predicting the clinical outcomes of AATD-LD and defining patients more likely to progress to advanced liver disease are crucial for better understanding AATD-LD progression and promoting timely medical intervention. We aimed to develop a tailored machine learning (ML) model to predict the disease progression of AATD-LD. This analysis was conducted through a stacking ensemble learning model by combining five different ML algorithms with 58 predictor variables using nested five-fold cross-validation with repetitions based on the UK Biobank data. Performance of the model was assessed through prediction accuracy, area under the receiver operating characteristic (AUROC), and area under the precision-recall curve (AUPRC). The importance of predictor contributions was evaluated through a feature importance permutation method. The proposed stacking ensemble ML model showed clinically meaningful accuracy and appeared superior to any single ML algorithms in the ensemble, e.g., the AUROC for AATD-LD was 68.1%, 75.9%, 91.2%, and 67.7% for all-cause mortality, liver-related death, liver transplant, and all-cause mortality or liver transplant, respectively. This work supports the use of ML to address the unanswered clinical questions with clinically meaningful accuracy using real-world data. Nature Publishing Group UK 2022-10-11 /pmc/articles/PMC9554039/ /pubmed/36220873 http://dx.doi.org/10.1038/s41598-022-21389-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Meng, Linxi Treem, Will Heap, Graham A. Chen, Jingjing A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data |
title | A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data |
title_full | A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data |
title_fullStr | A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data |
title_full_unstemmed | A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data |
title_short | A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data |
title_sort | stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on uk biobank data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9554039/ https://www.ncbi.nlm.nih.gov/pubmed/36220873 http://dx.doi.org/10.1038/s41598-022-21389-9 |
work_keys_str_mv | AT menglinxi astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata AT treemwill astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata AT heapgrahama astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata AT chenjingjing astackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata AT menglinxi stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata AT treemwill stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata AT heapgrahama stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata AT chenjingjing stackingensemblemachinelearningmodeltopredictalpha1antitrypsindeficiencyassociatedliverdiseaseclinicaloutcomesbasedonukbiobankdata |