Cargando…

kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes

BACKGROUND: Longitudinal data are data in which each variable is measured repeatedly over time. One possibility for the analysis of such data is to cluster them. The majority of clustering methods group together individual that have close trajectories at given time points. These methods group trajec...

Descripción completa

Detalles Bibliográficos
Autores principales:	Genolini, Christophe, Ecochard, René, Benghezal, Mamoun, Driss, Tarak, Andrieu, Sandrine, Subtil, Fabien
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4892497/ https://www.ncbi.nlm.nih.gov/pubmed/27258355 http://dx.doi.org/10.1371/journal.pone.0150738

_version_	1782435399024508928
author	Genolini, Christophe Ecochard, René Benghezal, Mamoun Driss, Tarak Andrieu, Sandrine Subtil, Fabien
author_facet	Genolini, Christophe Ecochard, René Benghezal, Mamoun Driss, Tarak Andrieu, Sandrine Subtil, Fabien
author_sort	Genolini, Christophe
collection	PubMed
description	BACKGROUND: Longitudinal data are data in which each variable is measured repeatedly over time. One possibility for the analysis of such data is to cluster them. The majority of clustering methods group together individual that have close trajectories at given time points. These methods group trajectories that are locally close but not necessarily those that have similar shapes. However, in several circumstances, the progress of a phenomenon may be more important than the moment at which it occurs. One would thus like to achieve a partitioning where each group gathers individuals whose trajectories have similar shapes whatever the time lag between them. METHOD: In this article, we present a longitudinal data partitioning algorithm based on the shapes of the trajectories rather than on classical distances. Because this algorithm is time consuming, we propose as well two data simplification procedures that make it applicable to high dimensional datasets. RESULTS: In an application to Alzheimer disease, this algorithm revealed a “rapid decline” patient group that was not found by the classical methods. In another application to the feminine menstrual cycle, the algorithm showed, contrarily to the current literature, that the luteinizing hormone presents two peaks in an important proportion of women (22%).
format	Online Article Text
id	pubmed-4892497
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-48924972016-06-16 kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes Genolini, Christophe Ecochard, René Benghezal, Mamoun Driss, Tarak Andrieu, Sandrine Subtil, Fabien PLoS One Research Article BACKGROUND: Longitudinal data are data in which each variable is measured repeatedly over time. One possibility for the analysis of such data is to cluster them. The majority of clustering methods group together individual that have close trajectories at given time points. These methods group trajectories that are locally close but not necessarily those that have similar shapes. However, in several circumstances, the progress of a phenomenon may be more important than the moment at which it occurs. One would thus like to achieve a partitioning where each group gathers individuals whose trajectories have similar shapes whatever the time lag between them. METHOD: In this article, we present a longitudinal data partitioning algorithm based on the shapes of the trajectories rather than on classical distances. Because this algorithm is time consuming, we propose as well two data simplification procedures that make it applicable to high dimensional datasets. RESULTS: In an application to Alzheimer disease, this algorithm revealed a “rapid decline” patient group that was not found by the classical methods. In another application to the feminine menstrual cycle, the algorithm showed, contrarily to the current literature, that the luteinizing hormone presents two peaks in an important proportion of women (22%). Public Library of Science 2016-06-03 /pmc/articles/PMC4892497/ /pubmed/27258355 http://dx.doi.org/10.1371/journal.pone.0150738 Text en © 2016 Genolini et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Genolini, Christophe Ecochard, René Benghezal, Mamoun Driss, Tarak Andrieu, Sandrine Subtil, Fabien kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes
title	kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes
title_full	kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes
title_fullStr	kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes
title_full_unstemmed	kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes
title_short	kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes
title_sort	kmlshape: an efficient method to cluster longitudinal data (time-series) according to their shapes
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4892497/ https://www.ncbi.nlm.nih.gov/pubmed/27258355 http://dx.doi.org/10.1371/journal.pone.0150738
work_keys_str_mv	AT genolinichristophe kmlshapeanefficientmethodtoclusterlongitudinaldatatimeseriesaccordingtotheirshapes AT ecochardrene kmlshapeanefficientmethodtoclusterlongitudinaldatatimeseriesaccordingtotheirshapes AT benghezalmamoun kmlshapeanefficientmethodtoclusterlongitudinaldatatimeseriesaccordingtotheirshapes AT drisstarak kmlshapeanefficientmethodtoclusterlongitudinaldatatimeseriesaccordingtotheirshapes AT andrieusandrine kmlshapeanefficientmethodtoclusterlongitudinaldatatimeseriesaccordingtotheirshapes AT subtilfabien kmlshapeanefficientmethodtoclusterlongitudinaldatatimeseriesaccordingtotheirshapes

kmlShape: An Efficient Method to Cluster Longitudinal Data (Time-Series) According to Their Shapes

Ejemplares similares