Cargando…

Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data

Hepatitis C virus (HCV) remains a significant public health challenge with approximately half of the infected population untreated and undiagnosed. In this retrospective study, predictive models were developed to identify undiagnosed HCV patients using longitudinal medical claims linked to prescript...

Descripción completa

Detalles Bibliográficos
Autores principales: Doyle, Orla M., Leavitt, Nadejda, Rigg, John A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324575/
https://www.ncbi.nlm.nih.gov/pubmed/32601354
http://dx.doi.org/10.1038/s41598-020-67013-6
_version_ 1783551965281648640
author Doyle, Orla M.
Leavitt, Nadejda
Rigg, John A.
author_facet Doyle, Orla M.
Leavitt, Nadejda
Rigg, John A.
author_sort Doyle, Orla M.
collection PubMed
description Hepatitis C virus (HCV) remains a significant public health challenge with approximately half of the infected population untreated and undiagnosed. In this retrospective study, predictive models were developed to identify undiagnosed HCV patients using longitudinal medical claims linked to prescription data from approximately ten million patients in the United States (US) between 2010 and 2016. Features capturing information on demographics, risk factors, symptoms, treatments and procedures relevant to HCV were extracted from patients’ medical history. Predictive algorithms were developed based on logistic regression, random forests, gradient boosted trees and a stacked ensemble. Descriptive analysis indicated that patients exhibited known symptoms of HCV on average 2–3 years prior to their diagnosis. The precision was at least 95% for all algorithms at low levels of recall (10%). For recall levels >50%, the stacked ensemble performed best with a precision of 97% compared with 87% for the gradient boosted trees and just 31% for the logistic regression. For context, the Center for Disease Control recommends screening in an at-risk sub-population with an estimated HCV prevalence of 2.23%. The artificial intelligence (AI) algorithm presented here has a precision which is substantially higher than the screening rates associated with recommended clinical guidelines, suggesting that AI algorithms have the potential to provide a step change in the effectiveness of HCV screening.
format Online
Article
Text
id pubmed-7324575
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73245752020-07-01 Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data Doyle, Orla M. Leavitt, Nadejda Rigg, John A. Sci Rep Article Hepatitis C virus (HCV) remains a significant public health challenge with approximately half of the infected population untreated and undiagnosed. In this retrospective study, predictive models were developed to identify undiagnosed HCV patients using longitudinal medical claims linked to prescription data from approximately ten million patients in the United States (US) between 2010 and 2016. Features capturing information on demographics, risk factors, symptoms, treatments and procedures relevant to HCV were extracted from patients’ medical history. Predictive algorithms were developed based on logistic regression, random forests, gradient boosted trees and a stacked ensemble. Descriptive analysis indicated that patients exhibited known symptoms of HCV on average 2–3 years prior to their diagnosis. The precision was at least 95% for all algorithms at low levels of recall (10%). For recall levels >50%, the stacked ensemble performed best with a precision of 97% compared with 87% for the gradient boosted trees and just 31% for the logistic regression. For context, the Center for Disease Control recommends screening in an at-risk sub-population with an estimated HCV prevalence of 2.23%. The artificial intelligence (AI) algorithm presented here has a precision which is substantially higher than the screening rates associated with recommended clinical guidelines, suggesting that AI algorithms have the potential to provide a step change in the effectiveness of HCV screening. Nature Publishing Group UK 2020-06-29 /pmc/articles/PMC7324575/ /pubmed/32601354 http://dx.doi.org/10.1038/s41598-020-67013-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Doyle, Orla M.
Leavitt, Nadejda
Rigg, John A.
Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data
title Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data
title_full Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data
title_fullStr Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data
title_full_unstemmed Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data
title_short Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data
title_sort finding undiagnosed patients with hepatitis c infection: an application of artificial intelligence to patient claims data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324575/
https://www.ncbi.nlm.nih.gov/pubmed/32601354
http://dx.doi.org/10.1038/s41598-020-67013-6
work_keys_str_mv AT doyleorlam findingundiagnosedpatientswithhepatitiscinfectionanapplicationofartificialintelligencetopatientclaimsdata
AT leavittnadejda findingundiagnosedpatientswithhepatitiscinfectionanapplicationofartificialintelligencetopatientclaimsdata
AT riggjohna findingundiagnosedpatientswithhepatitiscinfectionanapplicationofartificialintelligencetopatientclaimsdata