Cargando…

K-means clustering of overweight and obese population using quantile-transformed metabolic data

OBJECTIVE: Use of K-means clustering for big data technology to cluster an overweight and obese population metabolically. METHODS: K-means clustering with the help of quantile transformation of attribute values was applied to overcome the impact of the considerable variation in the values of obesity...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Li, Song, Qifa, Yang, Xi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6711566/
https://www.ncbi.nlm.nih.gov/pubmed/31692562
http://dx.doi.org/10.2147/DMSO.S206640
_version_ 1783446540511084544
author Li, Li
Song, Qifa
Yang, Xi
author_facet Li, Li
Song, Qifa
Yang, Xi
author_sort Li, Li
collection PubMed
description OBJECTIVE: Use of K-means clustering for big data technology to cluster an overweight and obese population metabolically. METHODS: K-means clustering with the help of quantile transformation of attribute values was applied to overcome the impact of the considerable variation in the values of obesity attributes involving outliers and skewed distribution. RESULTS: Overall, 447 subjects were categorized into six clusters; metabolically normal, mild, and severe categories. There were clearly separated metabolically normal Cluster 1 and severe Cluster 2, as well as intermediate Cluster 3, 4, and 5 that had profiles of fewer attributes with abnormal values. Cluster 3 was characteristic of sole hypertension. Cluster 3 and 4 exhibited contrasting HDL-C and LDL-C levels despite similarly elevated total cholesterol. Cluster 6 with slightly elevated triglyceride was closest to the normal group. Four- and 10-quantile-transformations yielded consistent clustering results. Compared with the original data, the quantile-transformed data produced more regular and spherical clusters and evenly distributed clusters in terms of object numbers. CONCLUSIONS: This big data analysis strategy makes use of quantile-transformation of data to overcome the issue of outliers and the irregular distribution and applies to the analysis of other non-communicable diseases.
format Online
Article
Text
id pubmed-6711566
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-67115662019-11-05 K-means clustering of overweight and obese population using quantile-transformed metabolic data Li, Li Song, Qifa Yang, Xi Diabetes Metab Syndr Obes Original Research OBJECTIVE: Use of K-means clustering for big data technology to cluster an overweight and obese population metabolically. METHODS: K-means clustering with the help of quantile transformation of attribute values was applied to overcome the impact of the considerable variation in the values of obesity attributes involving outliers and skewed distribution. RESULTS: Overall, 447 subjects were categorized into six clusters; metabolically normal, mild, and severe categories. There were clearly separated metabolically normal Cluster 1 and severe Cluster 2, as well as intermediate Cluster 3, 4, and 5 that had profiles of fewer attributes with abnormal values. Cluster 3 was characteristic of sole hypertension. Cluster 3 and 4 exhibited contrasting HDL-C and LDL-C levels despite similarly elevated total cholesterol. Cluster 6 with slightly elevated triglyceride was closest to the normal group. Four- and 10-quantile-transformations yielded consistent clustering results. Compared with the original data, the quantile-transformed data produced more regular and spherical clusters and evenly distributed clusters in terms of object numbers. CONCLUSIONS: This big data analysis strategy makes use of quantile-transformation of data to overcome the issue of outliers and the irregular distribution and applies to the analysis of other non-communicable diseases. Dove 2019-08-23 /pmc/articles/PMC6711566/ /pubmed/31692562 http://dx.doi.org/10.2147/DMSO.S206640 Text en © 2019 Li et al. http://creativecommons.org/licenses/by-nc/3.0/ This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Li, Li
Song, Qifa
Yang, Xi
K-means clustering of overweight and obese population using quantile-transformed metabolic data
title K-means clustering of overweight and obese population using quantile-transformed metabolic data
title_full K-means clustering of overweight and obese population using quantile-transformed metabolic data
title_fullStr K-means clustering of overweight and obese population using quantile-transformed metabolic data
title_full_unstemmed K-means clustering of overweight and obese population using quantile-transformed metabolic data
title_short K-means clustering of overweight and obese population using quantile-transformed metabolic data
title_sort k-means clustering of overweight and obese population using quantile-transformed metabolic data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6711566/
https://www.ncbi.nlm.nih.gov/pubmed/31692562
http://dx.doi.org/10.2147/DMSO.S206640
work_keys_str_mv AT lili kmeansclusteringofoverweightandobesepopulationusingquantiletransformedmetabolicdata
AT songqifa kmeansclusteringofoverweightandobesepopulationusingquantiletransformedmetabolicdata
AT yangxi kmeansclusteringofoverweightandobesepopulationusingquantiletransformedmetabolicdata