Cargando…

Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care

Familial hypercholesterolaemia (FH) is a common inherited disorder, causing lifelong elevated low-density lipoprotein cholesterol (LDL-C). Most individuals with FH remain undiagnosed, precluding opportunities to prevent premature heart disease and death. Some machine-learning approaches improve dete...

Descripción completa

Detalles Bibliográficos
Autores principales: Akyea, Ralph K., Qureshi, Nadeem, Kai, Joe, Weng, Stephen F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7603302/
https://www.ncbi.nlm.nih.gov/pubmed/33145438
http://dx.doi.org/10.1038/s41746-020-00349-5
_version_ 1783603889459691520
author Akyea, Ralph K.
Qureshi, Nadeem
Kai, Joe
Weng, Stephen F.
author_facet Akyea, Ralph K.
Qureshi, Nadeem
Kai, Joe
Weng, Stephen F.
author_sort Akyea, Ralph K.
collection PubMed
description Familial hypercholesterolaemia (FH) is a common inherited disorder, causing lifelong elevated low-density lipoprotein cholesterol (LDL-C). Most individuals with FH remain undiagnosed, precluding opportunities to prevent premature heart disease and death. Some machine-learning approaches improve detection of FH in electronic health records, though clinical impact is under-explored. We assessed performance of an array of machine-learning approaches for enhancing detection of FH, and their clinical utility, within a large primary care population. A retrospective cohort study was done using routine primary care clinical records of 4,027,775 individuals from the United Kingdom with total cholesterol measured from 1 January 1999 to 25 June 2019. Predictive accuracy of five common machine-learning algorithms (logistic regression, random forest, gradient boosting machines, neural networks and ensemble learning) were assessed for detecting FH. Predictive accuracy was assessed by area under the receiver operating curves (AUC) and expected vs observed calibration slope; with clinical utility assessed by expected case-review workload and likelihood ratios. There were 7928 incident diagnoses of FH. In addition to known clinical features of FH (raised total cholesterol or LDL-C and family history of premature coronary heart disease), machine-learning (ML) algorithms identified features such as raised triglycerides which reduced the likelihood of FH. Apart from logistic regression (AUC, 0.81), all four other ML approaches had similarly high predictive accuracy (AUC > 0.89). Calibration slope ranged from 0.997 for gradient boosting machines to 1.857 for logistic regression. Among those screened, high probability cases requiring clinical review varied from 0.73% using ensemble learning to 10.16% using deep learning, but with positive predictive values of 15.5% and 2.8% respectively. Ensemble learning exhibited a dominant positive likelihood ratio (45.5) compared to all other ML models (7.0–14.4). Machine-learning models show similar high accuracy in detecting FH, offering opportunities to increase diagnosis. However, the clinical case-finding workload required for yield of cases will differ substantially between models.
format Online
Article
Text
id pubmed-7603302
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-76033022020-11-02 Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care Akyea, Ralph K. Qureshi, Nadeem Kai, Joe Weng, Stephen F. NPJ Digit Med Article Familial hypercholesterolaemia (FH) is a common inherited disorder, causing lifelong elevated low-density lipoprotein cholesterol (LDL-C). Most individuals with FH remain undiagnosed, precluding opportunities to prevent premature heart disease and death. Some machine-learning approaches improve detection of FH in electronic health records, though clinical impact is under-explored. We assessed performance of an array of machine-learning approaches for enhancing detection of FH, and their clinical utility, within a large primary care population. A retrospective cohort study was done using routine primary care clinical records of 4,027,775 individuals from the United Kingdom with total cholesterol measured from 1 January 1999 to 25 June 2019. Predictive accuracy of five common machine-learning algorithms (logistic regression, random forest, gradient boosting machines, neural networks and ensemble learning) were assessed for detecting FH. Predictive accuracy was assessed by area under the receiver operating curves (AUC) and expected vs observed calibration slope; with clinical utility assessed by expected case-review workload and likelihood ratios. There were 7928 incident diagnoses of FH. In addition to known clinical features of FH (raised total cholesterol or LDL-C and family history of premature coronary heart disease), machine-learning (ML) algorithms identified features such as raised triglycerides which reduced the likelihood of FH. Apart from logistic regression (AUC, 0.81), all four other ML approaches had similarly high predictive accuracy (AUC > 0.89). Calibration slope ranged from 0.997 for gradient boosting machines to 1.857 for logistic regression. Among those screened, high probability cases requiring clinical review varied from 0.73% using ensemble learning to 10.16% using deep learning, but with positive predictive values of 15.5% and 2.8% respectively. Ensemble learning exhibited a dominant positive likelihood ratio (45.5) compared to all other ML models (7.0–14.4). Machine-learning models show similar high accuracy in detecting FH, offering opportunities to increase diagnosis. However, the clinical case-finding workload required for yield of cases will differ substantially between models. Nature Publishing Group UK 2020-10-30 /pmc/articles/PMC7603302/ /pubmed/33145438 http://dx.doi.org/10.1038/s41746-020-00349-5 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Akyea, Ralph K.
Qureshi, Nadeem
Kai, Joe
Weng, Stephen F.
Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care
title Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care
title_full Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care
title_fullStr Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care
title_full_unstemmed Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care
title_short Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care
title_sort performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7603302/
https://www.ncbi.nlm.nih.gov/pubmed/33145438
http://dx.doi.org/10.1038/s41746-020-00349-5
work_keys_str_mv AT akyearalphk performanceandclinicalutilityofsupervisedmachinelearningapproachesindetectingfamilialhypercholesterolaemiainprimarycare
AT qureshinadeem performanceandclinicalutilityofsupervisedmachinelearningapproachesindetectingfamilialhypercholesterolaemiainprimarycare
AT kaijoe performanceandclinicalutilityofsupervisedmachinelearningapproachesindetectingfamilialhypercholesterolaemiainprimarycare
AT wengstephenf performanceandclinicalutilityofsupervisedmachinelearningapproachesindetectingfamilialhypercholesterolaemiainprimarycare