Cargando…
Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data
Hepatitis C virus (HCV) remains a significant public health challenge with approximately half of the infected population untreated and undiagnosed. In this retrospective study, predictive models were developed to identify undiagnosed HCV patients using longitudinal medical claims linked to prescript...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324575/ https://www.ncbi.nlm.nih.gov/pubmed/32601354 http://dx.doi.org/10.1038/s41598-020-67013-6 |
_version_ | 1783551965281648640 |
---|---|
author | Doyle, Orla M. Leavitt, Nadejda Rigg, John A. |
author_facet | Doyle, Orla M. Leavitt, Nadejda Rigg, John A. |
author_sort | Doyle, Orla M. |
collection | PubMed |
description | Hepatitis C virus (HCV) remains a significant public health challenge with approximately half of the infected population untreated and undiagnosed. In this retrospective study, predictive models were developed to identify undiagnosed HCV patients using longitudinal medical claims linked to prescription data from approximately ten million patients in the United States (US) between 2010 and 2016. Features capturing information on demographics, risk factors, symptoms, treatments and procedures relevant to HCV were extracted from patients’ medical history. Predictive algorithms were developed based on logistic regression, random forests, gradient boosted trees and a stacked ensemble. Descriptive analysis indicated that patients exhibited known symptoms of HCV on average 2–3 years prior to their diagnosis. The precision was at least 95% for all algorithms at low levels of recall (10%). For recall levels >50%, the stacked ensemble performed best with a precision of 97% compared with 87% for the gradient boosted trees and just 31% for the logistic regression. For context, the Center for Disease Control recommends screening in an at-risk sub-population with an estimated HCV prevalence of 2.23%. The artificial intelligence (AI) algorithm presented here has a precision which is substantially higher than the screening rates associated with recommended clinical guidelines, suggesting that AI algorithms have the potential to provide a step change in the effectiveness of HCV screening. |
format | Online Article Text |
id | pubmed-7324575 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-73245752020-07-01 Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data Doyle, Orla M. Leavitt, Nadejda Rigg, John A. Sci Rep Article Hepatitis C virus (HCV) remains a significant public health challenge with approximately half of the infected population untreated and undiagnosed. In this retrospective study, predictive models were developed to identify undiagnosed HCV patients using longitudinal medical claims linked to prescription data from approximately ten million patients in the United States (US) between 2010 and 2016. Features capturing information on demographics, risk factors, symptoms, treatments and procedures relevant to HCV were extracted from patients’ medical history. Predictive algorithms were developed based on logistic regression, random forests, gradient boosted trees and a stacked ensemble. Descriptive analysis indicated that patients exhibited known symptoms of HCV on average 2–3 years prior to their diagnosis. The precision was at least 95% for all algorithms at low levels of recall (10%). For recall levels >50%, the stacked ensemble performed best with a precision of 97% compared with 87% for the gradient boosted trees and just 31% for the logistic regression. For context, the Center for Disease Control recommends screening in an at-risk sub-population with an estimated HCV prevalence of 2.23%. The artificial intelligence (AI) algorithm presented here has a precision which is substantially higher than the screening rates associated with recommended clinical guidelines, suggesting that AI algorithms have the potential to provide a step change in the effectiveness of HCV screening. Nature Publishing Group UK 2020-06-29 /pmc/articles/PMC7324575/ /pubmed/32601354 http://dx.doi.org/10.1038/s41598-020-67013-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Doyle, Orla M. Leavitt, Nadejda Rigg, John A. Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data |
title | Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data |
title_full | Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data |
title_fullStr | Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data |
title_full_unstemmed | Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data |
title_short | Finding undiagnosed patients with hepatitis C infection: an application of artificial intelligence to patient claims data |
title_sort | finding undiagnosed patients with hepatitis c infection: an application of artificial intelligence to patient claims data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324575/ https://www.ncbi.nlm.nih.gov/pubmed/32601354 http://dx.doi.org/10.1038/s41598-020-67013-6 |
work_keys_str_mv | AT doyleorlam findingundiagnosedpatientswithhepatitiscinfectionanapplicationofartificialintelligencetopatientclaimsdata AT leavittnadejda findingundiagnosedpatientswithhepatitiscinfectionanapplicationofartificialintelligencetopatientclaimsdata AT riggjohna findingundiagnosedpatientswithhepatitiscinfectionanapplicationofartificialintelligencetopatientclaimsdata |