Cargando…

A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms

BACKGROUND: A zero-inflated continuous outcome is characterized by occurrence of "excess" zeros that more than a single distribution can explain, with the positive observations forming a skewed distribution. Mixture models are employed for regression analysis of zero-inflated data. Moreove...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahmud, Sadia, Lou, WY Wendy, Johnston, Neil W
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2902491/
https://www.ncbi.nlm.nih.gov/pubmed/20540810
http://dx.doi.org/10.1186/1471-2288-10-55
_version_ 1782183771601108992
author Mahmud, Sadia
Lou, WY Wendy
Johnston, Neil W
author_facet Mahmud, Sadia
Lou, WY Wendy
Johnston, Neil W
author_sort Mahmud, Sadia
collection PubMed
description BACKGROUND: A zero-inflated continuous outcome is characterized by occurrence of "excess" zeros that more than a single distribution can explain, with the positive observations forming a skewed distribution. Mixture models are employed for regression analysis of zero-inflated data. Moreover, for repeated measures zero-inflated data the clustering structure should also be modeled for an adequate analysis. METHODS: Diary of Asthma and Viral Infections Study (DAVIS) was a one year (2004) cohort study conducted at McMaster University to monitor viral infection and respiratory symptoms in children aged 5-11 years with and without asthma. Respiratory symptoms were recorded daily using either an Internet or paper-based diary. Changes in symptoms were assessed by study staff and led to collection of nasal fluid specimens for virological testing. The study objectives included investigating the response of respiratory symptoms to respiratory viral infection in children with and without asthma over a one year period. Due to sparse data daily respiratory symptom scores were aggregated into weekly average scores. More than 70% of the weekly average scores were zero, with the positive scores forming a skewed distribution. We propose a random effects probit/log-skew-normal mixture model to analyze the DAVIS data. The model parameters were estimated using a maximum marginal likelihood approach. A simulation study was conducted to assess the performance of the proposed mixture model if the underlying distribution of the positive response is different from log-skew normal. RESULTS: Viral infection status was highly significant in both probit and log-skew normal model components respectively. The probability of being symptom free was much lower for the week a child was viral positive relative to the week she/he was viral negative. The severity of the symptoms was also greater for the week a child was viral positive. The probability of being symptom free was smaller for asthmatics relative to non-asthmatics throughout the year, whereas there was no difference in the severity of the symptoms between the two groups. CONCLUSIONS: A positive association was observed between viral infection status and both the probability of experiencing any respiratory symptoms, and their severity during the year. For DAVIS data the random effects probit -log skew normal model fits significantly better than the random effects probit -log normal model, endorsing our parametric choice for the model. The simulation study indicates that our proposed model seems to be robust to misspecification of the distribution of the positive skewed response.
format Text
id pubmed-2902491
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29024912010-07-13 A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms Mahmud, Sadia Lou, WY Wendy Johnston, Neil W BMC Med Res Methodol Research Article BACKGROUND: A zero-inflated continuous outcome is characterized by occurrence of "excess" zeros that more than a single distribution can explain, with the positive observations forming a skewed distribution. Mixture models are employed for regression analysis of zero-inflated data. Moreover, for repeated measures zero-inflated data the clustering structure should also be modeled for an adequate analysis. METHODS: Diary of Asthma and Viral Infections Study (DAVIS) was a one year (2004) cohort study conducted at McMaster University to monitor viral infection and respiratory symptoms in children aged 5-11 years with and without asthma. Respiratory symptoms were recorded daily using either an Internet or paper-based diary. Changes in symptoms were assessed by study staff and led to collection of nasal fluid specimens for virological testing. The study objectives included investigating the response of respiratory symptoms to respiratory viral infection in children with and without asthma over a one year period. Due to sparse data daily respiratory symptom scores were aggregated into weekly average scores. More than 70% of the weekly average scores were zero, with the positive scores forming a skewed distribution. We propose a random effects probit/log-skew-normal mixture model to analyze the DAVIS data. The model parameters were estimated using a maximum marginal likelihood approach. A simulation study was conducted to assess the performance of the proposed mixture model if the underlying distribution of the positive response is different from log-skew normal. RESULTS: Viral infection status was highly significant in both probit and log-skew normal model components respectively. The probability of being symptom free was much lower for the week a child was viral positive relative to the week she/he was viral negative. The severity of the symptoms was also greater for the week a child was viral positive. The probability of being symptom free was smaller for asthmatics relative to non-asthmatics throughout the year, whereas there was no difference in the severity of the symptoms between the two groups. CONCLUSIONS: A positive association was observed between viral infection status and both the probability of experiencing any respiratory symptoms, and their severity during the year. For DAVIS data the random effects probit -log skew normal model fits significantly better than the random effects probit -log normal model, endorsing our parametric choice for the model. The simulation study indicates that our proposed model seems to be robust to misspecification of the distribution of the positive skewed response. BioMed Central 2010-06-14 /pmc/articles/PMC2902491/ /pubmed/20540810 http://dx.doi.org/10.1186/1471-2288-10-55 Text en Copyright ©2010 Mahmud et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mahmud, Sadia
Lou, WY Wendy
Johnston, Neil W
A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms
title A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms
title_full A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms
title_fullStr A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms
title_full_unstemmed A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms
title_short A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms
title_sort probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2902491/
https://www.ncbi.nlm.nih.gov/pubmed/20540810
http://dx.doi.org/10.1186/1471-2288-10-55
work_keys_str_mv AT mahmudsadia aprobitlogskewnormalmixturemodelforrepeatedmeasuresdatawithexcesszeroswithapplicationtoacohortstudyofpaediatricrespiratorysymptoms
AT louwywendy aprobitlogskewnormalmixturemodelforrepeatedmeasuresdatawithexcesszeroswithapplicationtoacohortstudyofpaediatricrespiratorysymptoms
AT johnstonneilw aprobitlogskewnormalmixturemodelforrepeatedmeasuresdatawithexcesszeroswithapplicationtoacohortstudyofpaediatricrespiratorysymptoms
AT mahmudsadia probitlogskewnormalmixturemodelforrepeatedmeasuresdatawithexcesszeroswithapplicationtoacohortstudyofpaediatricrespiratorysymptoms
AT louwywendy probitlogskewnormalmixturemodelforrepeatedmeasuresdatawithexcesszeroswithapplicationtoacohortstudyofpaediatricrespiratorysymptoms
AT johnstonneilw probitlogskewnormalmixturemodelforrepeatedmeasuresdatawithexcesszeroswithapplicationtoacohortstudyofpaediatricrespiratorysymptoms