Cargando…

Identifying incident dementia by applying machine learning to a very large administrative claims dataset

Alzheimer's disease and related dementias (ADRD) are highly prevalent conditions, and prior efforts to develop predictive models have relied on demographic and clinical risk factors using traditional logistical regression methods. We hypothesized that machine-learning algorithms using administr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Nori, Vijay S., Hane, Christopher A., Martin, David C., Kravetz, Alexander D., Sanghavi, Darshak M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6611655/ https://www.ncbi.nlm.nih.gov/pubmed/31276468 http://dx.doi.org/10.1371/journal.pone.0203246

_version_	1783432738903162880
author	Nori, Vijay S. Hane, Christopher A. Martin, David C. Kravetz, Alexander D. Sanghavi, Darshak M.
author_facet	Nori, Vijay S. Hane, Christopher A. Martin, David C. Kravetz, Alexander D. Sanghavi, Darshak M.
author_sort	Nori, Vijay S.
collection	PubMed
description	Alzheimer's disease and related dementias (ADRD) are highly prevalent conditions, and prior efforts to develop predictive models have relied on demographic and clinical risk factors using traditional logistical regression methods. We hypothesized that machine-learning algorithms using administrative claims data may represent a novel approach to predicting ADRD. Using a national de-identified dataset of more than 125 million patients including over 10,000 clinical, pharmaceutical, and demographic variables, we developed a cohort to train a machine learning model to predict ADRD 4–5 years in advance. The Lasso algorithm selected a 50-variable model with an area under the curve (AUC) of 0.693. Top diagnosis codes in the model were memory loss (780.93), Parkinson’s disease (332.0), mild cognitive impairment (331.83) and bipolar disorder (296.80), and top pharmacy codes were psychoactive drugs. Machine learning algorithms can rapidly develop predictive models for ADRD with massive datasets, without requiring hypothesis-driven feature engineering.
format	Online Article Text
id	pubmed-6611655
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-66116552019-07-12 Identifying incident dementia by applying machine learning to a very large administrative claims dataset Nori, Vijay S. Hane, Christopher A. Martin, David C. Kravetz, Alexander D. Sanghavi, Darshak M. PLoS One Research Article Alzheimer's disease and related dementias (ADRD) are highly prevalent conditions, and prior efforts to develop predictive models have relied on demographic and clinical risk factors using traditional logistical regression methods. We hypothesized that machine-learning algorithms using administrative claims data may represent a novel approach to predicting ADRD. Using a national de-identified dataset of more than 125 million patients including over 10,000 clinical, pharmaceutical, and demographic variables, we developed a cohort to train a machine learning model to predict ADRD 4–5 years in advance. The Lasso algorithm selected a 50-variable model with an area under the curve (AUC) of 0.693. Top diagnosis codes in the model were memory loss (780.93), Parkinson’s disease (332.0), mild cognitive impairment (331.83) and bipolar disorder (296.80), and top pharmacy codes were psychoactive drugs. Machine learning algorithms can rapidly develop predictive models for ADRD with massive datasets, without requiring hypothesis-driven feature engineering. Public Library of Science 2019-07-05 /pmc/articles/PMC6611655/ /pubmed/31276468 http://dx.doi.org/10.1371/journal.pone.0203246 Text en © 2019 Nori et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Nori, Vijay S. Hane, Christopher A. Martin, David C. Kravetz, Alexander D. Sanghavi, Darshak M. Identifying incident dementia by applying machine learning to a very large administrative claims dataset
title	Identifying incident dementia by applying machine learning to a very large administrative claims dataset
title_full	Identifying incident dementia by applying machine learning to a very large administrative claims dataset
title_fullStr	Identifying incident dementia by applying machine learning to a very large administrative claims dataset
title_full_unstemmed	Identifying incident dementia by applying machine learning to a very large administrative claims dataset
title_short	Identifying incident dementia by applying machine learning to a very large administrative claims dataset
title_sort	identifying incident dementia by applying machine learning to a very large administrative claims dataset
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6611655/ https://www.ncbi.nlm.nih.gov/pubmed/31276468 http://dx.doi.org/10.1371/journal.pone.0203246
work_keys_str_mv	AT norivijays identifyingincidentdementiabyapplyingmachinelearningtoaverylargeadministrativeclaimsdataset AT hanechristophera identifyingincidentdementiabyapplyingmachinelearningtoaverylargeadministrativeclaimsdataset AT martindavidc identifyingincidentdementiabyapplyingmachinelearningtoaverylargeadministrativeclaimsdataset AT kravetzalexanderd identifyingincidentdementiabyapplyingmachinelearningtoaverylargeadministrativeclaimsdataset AT sanghavidarshakm identifyingincidentdementiabyapplyingmachinelearningtoaverylargeadministrativeclaimsdataset

Identifying incident dementia by applying machine learning to a very large administrative claims dataset

Ejemplares similares