Cargando…

Visualising and modelling changes in categorical variables in longitudinal studies

BACKGROUND: Graphical techniques can provide visually compelling insights into complex data patterns. In this paper we present a type of lasagne plot showing changes in categorical variables for participants measured at regular intervals over time and propose statistical models to estimate distribut...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Mark, Hockey, Richard, Mishra, Gita D, Dobson, Annette
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3938907/
https://www.ncbi.nlm.nih.gov/pubmed/24576041
http://dx.doi.org/10.1186/1471-2288-14-32
_version_ 1782305679046868992
author Jones, Mark
Hockey, Richard
Mishra, Gita D
Dobson, Annette
author_facet Jones, Mark
Hockey, Richard
Mishra, Gita D
Dobson, Annette
author_sort Jones, Mark
collection PubMed
description BACKGROUND: Graphical techniques can provide visually compelling insights into complex data patterns. In this paper we present a type of lasagne plot showing changes in categorical variables for participants measured at regular intervals over time and propose statistical models to estimate distributions of marginal and transitional probabilities. METHODS: The plot uses stacked bars to show the distribution of categorical variables at each time interval, with different colours to depict different categories and changes in colours showing trajectories of participants over time. The models are based on nominal logistic regression which is appropriate for both ordinal and nominal categorical variables. To illustrate the plots and models we analyse data on smoking status, body mass index (BMI) and physical activity level from a longitudinal study on women’s health. To estimate marginal distributions we fit survey wave as an explanatory variable whereas for transitional distributions we fit status of participants (e.g. smoking status) at previous surveys. RESULTS: For the illustrative data the marginal models showed BMI increasing, physical activity decreasing and smoking decreasing linearly over time at the population level. The plots and transition models showed smoking status to be highly predictable for individuals whereas BMI was only moderately predictable and physical activity was virtually unpredictable. Most of the predictive power was obtained from participant status at the previous survey. Predicted probabilities from the models mostly agreed with observed probabilities indicating adequate goodness-of-fit. CONCLUSIONS: The proposed form of lasagne plot provides a simple visual aid to show transitions in categorical variables over time in longitudinal studies. The suggested models complement the plot and allow formal testing and estimation of marginal and transitional distributions. These simple tools can provide valuable insights into categorical data on individuals measured at regular intervals over time.
format Online
Article
Text
id pubmed-3938907
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39389072014-03-10 Visualising and modelling changes in categorical variables in longitudinal studies Jones, Mark Hockey, Richard Mishra, Gita D Dobson, Annette BMC Med Res Methodol Research Article BACKGROUND: Graphical techniques can provide visually compelling insights into complex data patterns. In this paper we present a type of lasagne plot showing changes in categorical variables for participants measured at regular intervals over time and propose statistical models to estimate distributions of marginal and transitional probabilities. METHODS: The plot uses stacked bars to show the distribution of categorical variables at each time interval, with different colours to depict different categories and changes in colours showing trajectories of participants over time. The models are based on nominal logistic regression which is appropriate for both ordinal and nominal categorical variables. To illustrate the plots and models we analyse data on smoking status, body mass index (BMI) and physical activity level from a longitudinal study on women’s health. To estimate marginal distributions we fit survey wave as an explanatory variable whereas for transitional distributions we fit status of participants (e.g. smoking status) at previous surveys. RESULTS: For the illustrative data the marginal models showed BMI increasing, physical activity decreasing and smoking decreasing linearly over time at the population level. The plots and transition models showed smoking status to be highly predictable for individuals whereas BMI was only moderately predictable and physical activity was virtually unpredictable. Most of the predictive power was obtained from participant status at the previous survey. Predicted probabilities from the models mostly agreed with observed probabilities indicating adequate goodness-of-fit. CONCLUSIONS: The proposed form of lasagne plot provides a simple visual aid to show transitions in categorical variables over time in longitudinal studies. The suggested models complement the plot and allow formal testing and estimation of marginal and transitional distributions. These simple tools can provide valuable insights into categorical data on individuals measured at regular intervals over time. BioMed Central 2014-02-27 /pmc/articles/PMC3938907/ /pubmed/24576041 http://dx.doi.org/10.1186/1471-2288-14-32 Text en Copyright © 2014 Jones et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Jones, Mark
Hockey, Richard
Mishra, Gita D
Dobson, Annette
Visualising and modelling changes in categorical variables in longitudinal studies
title Visualising and modelling changes in categorical variables in longitudinal studies
title_full Visualising and modelling changes in categorical variables in longitudinal studies
title_fullStr Visualising and modelling changes in categorical variables in longitudinal studies
title_full_unstemmed Visualising and modelling changes in categorical variables in longitudinal studies
title_short Visualising and modelling changes in categorical variables in longitudinal studies
title_sort visualising and modelling changes in categorical variables in longitudinal studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3938907/
https://www.ncbi.nlm.nih.gov/pubmed/24576041
http://dx.doi.org/10.1186/1471-2288-14-32
work_keys_str_mv AT jonesmark visualisingandmodellingchangesincategoricalvariablesinlongitudinalstudies
AT hockeyrichard visualisingandmodellingchangesincategoricalvariablesinlongitudinalstudies
AT mishragitad visualisingandmodellingchangesincategoricalvariablesinlongitudinalstudies
AT dobsonannette visualisingandmodellingchangesincategoricalvariablesinlongitudinalstudies