Cargando…

In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm

BACKGROUND AND OBJECTIVES: To investigate the reproducibility and validity of latent class analysis (LCA) and hierarchical cluster analysis (HCA), multiple correspondence analysis followed by k-means (MCA-kmeans) and k-means (kmeans) for multimorbidity clustering. METHODS: We first investigated clus...

Descripción completa

Detalles Bibliográficos
Autores principales:	Nichols, Linda, Taverner, Tom, Crowe, Francesca, Richardson, Sylvia, Yau, Christopher, Kiddle, Steven, Kirk, Paul, Barrett, Jessica, Nirantharakumar, Krishnarajah, Griffin, Simon, Edwards, Duncan, Marshall, Tom
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7613854/ https://www.ncbi.nlm.nih.gov/pubmed/36228971 http://dx.doi.org/10.1016/j.jclinepi.2022.10.011

_version_	1783605529980960768
author	Nichols, Linda Taverner, Tom Crowe, Francesca Richardson, Sylvia Yau, Christopher Kiddle, Steven Kirk, Paul Barrett, Jessica Nirantharakumar, Krishnarajah Griffin, Simon Edwards, Duncan Marshall, Tom
author_facet	Nichols, Linda Taverner, Tom Crowe, Francesca Richardson, Sylvia Yau, Christopher Kiddle, Steven Kirk, Paul Barrett, Jessica Nirantharakumar, Krishnarajah Griffin, Simon Edwards, Duncan Marshall, Tom
author_sort	Nichols, Linda
collection	PubMed
description	BACKGROUND AND OBJECTIVES: To investigate the reproducibility and validity of latent class analysis (LCA) and hierarchical cluster analysis (HCA), multiple correspondence analysis followed by k-means (MCA-kmeans) and k-means (kmeans) for multimorbidity clustering. METHODS: We first investigated clustering algorithms in simulated datasets with 26 diseases of varying prevalence in predetermined clusters, comparing the derived clusters to known clusters using the adjusted Rand Index (aRI). We then them investigated in the medical records of male patients, aged 65 to 84 years from 50 UK general practices, with 49 long-term health conditions. We compared within cluster morbidity profiles using the Pearson correlation coefficient and assessed cluster stability was in 400 bootstrap samples. RESULTS: In the simulated datasets, the closest agreement (largest aRI) to known clusters was with LCA and then MCA-kmeans algorithms. In the medical records dataset, all four algorithms identified one cluster of 20–25% of the dataset with about 82% of the same patients across all four algorithms. LCA and MCA-kmeans both found a second cluster of 7% of the dataset. Other clusters were found by only one algorithm. LCA and MCA-kmeans clustering gave the most similar partitioning (aRI 0.54). CONCLUSION: LCA achieved higher aRI than other clustering algorithms.
format	Online Article Text
id	pubmed-7613854
institution	National Center for Biotechnology Information
language	English
publishDate	2022
record_format	MEDLINE/PubMed
spelling	pubmed-76138542022-11-27 In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm Nichols, Linda Taverner, Tom Crowe, Francesca Richardson, Sylvia Yau, Christopher Kiddle, Steven Kirk, Paul Barrett, Jessica Nirantharakumar, Krishnarajah Griffin, Simon Edwards, Duncan Marshall, Tom J Clin Epidemiol Article BACKGROUND AND OBJECTIVES: To investigate the reproducibility and validity of latent class analysis (LCA) and hierarchical cluster analysis (HCA), multiple correspondence analysis followed by k-means (MCA-kmeans) and k-means (kmeans) for multimorbidity clustering. METHODS: We first investigated clustering algorithms in simulated datasets with 26 diseases of varying prevalence in predetermined clusters, comparing the derived clusters to known clusters using the adjusted Rand Index (aRI). We then them investigated in the medical records of male patients, aged 65 to 84 years from 50 UK general practices, with 49 long-term health conditions. We compared within cluster morbidity profiles using the Pearson correlation coefficient and assessed cluster stability was in 400 bootstrap samples. RESULTS: In the simulated datasets, the closest agreement (largest aRI) to known clusters was with LCA and then MCA-kmeans algorithms. In the medical records dataset, all four algorithms identified one cluster of 20–25% of the dataset with about 82% of the same patients across all four algorithms. LCA and MCA-kmeans both found a second cluster of 7% of the dataset. Other clusters were found by only one algorithm. LCA and MCA-kmeans clustering gave the most similar partitioning (aRI 0.54). CONCLUSION: LCA achieved higher aRI than other clustering algorithms. 2022-10-11 2022-10-11 /pmc/articles/PMC7613854/ /pubmed/36228971 http://dx.doi.org/10.1016/j.jclinepi.2022.10.011 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/) International license.
spellingShingle	Article Nichols, Linda Taverner, Tom Crowe, Francesca Richardson, Sylvia Yau, Christopher Kiddle, Steven Kirk, Paul Barrett, Jessica Nirantharakumar, Krishnarajah Griffin, Simon Edwards, Duncan Marshall, Tom In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm
title	In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm
title_full	In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm
title_fullStr	In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm
title_full_unstemmed	In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm
title_short	In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm
title_sort	in simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7613854/ https://www.ncbi.nlm.nih.gov/pubmed/36228971 http://dx.doi.org/10.1016/j.jclinepi.2022.10.011
work_keys_str_mv	AT nicholslinda insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT tavernertom insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT crowefrancesca insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT richardsonsylvia insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT yauchristopher insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT kiddlesteven insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT kirkpaul insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT barrettjessica insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT nirantharakumarkrishnarajah insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT griffinsimon insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT edwardsduncan insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm AT marshalltom insimulateddataandhealthrecordslatentclassanalysiswastheoptimummultimorbidityclusteringalgorithm

In simulated data and health records, latent class analysis was the optimum multimorbidity clustering algorithm

Ejemplares similares