Cargando…

Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm

Binary Generalized Linear Mixed Model (GLMM) is the most common method used by researchers to analyze clustered binary data in biological and social sciences. The traditional approach to GLMMs causes substantial bias in estimates due to steady shape of logistic and normal distribution assumptions th...

Descripción completa

Detalles Bibliográficos
Autores principales: Tovissodé, Chénangnon Frédéric, Diop, Aliou, Glèlè Kakaï, Romain
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8028747/
https://www.ncbi.nlm.nih.gov/pubmed/33822818
http://dx.doi.org/10.1371/journal.pone.0249604
_version_ 1783675997904699392
author Tovissodé, Chénangnon Frédéric
Diop, Aliou
Glèlè Kakaï, Romain
author_facet Tovissodé, Chénangnon Frédéric
Diop, Aliou
Glèlè Kakaï, Romain
author_sort Tovissodé, Chénangnon Frédéric
collection PubMed
description Binary Generalized Linear Mixed Model (GLMM) is the most common method used by researchers to analyze clustered binary data in biological and social sciences. The traditional approach to GLMMs causes substantial bias in estimates due to steady shape of logistic and normal distribution assumptions thereby resulting into wrong and misleading decisions. This study brings forward an approach governed by skew generalized t distributions that belong to a class of potentially skewed and heavy tailed distributions. Interestingly, both the traditional logistic and probit mixed models, as well as other available methods can be utilized within the skew generalized t-link model (SGTLM) frame. We have taken advantage of the Expectation-Maximization algorithm accelerated via parameter-expansion for model fitting. We evaluated the performance of this approach to GLMMs through a simulation experiment by varying sample size and data distribution. Our findings indicated that the proposed methodology outperforms competing approaches in estimating population parameters and predicting random effects, when the traditional link and normality assumptions are violated. In addition, empirical standard errors and information criteria proved useful for detecting spurious skewness and avoiding complex models for probit data. An application with respiratory infection data points out to the superiority of the SGTLM which turns to be the most adequate model. In future, studies should focus on integrating the demonstrated flexibility in other generalized linear mixed models to enhance robust modeling.
format Online
Article
Text
id pubmed-8028747
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-80287472021-04-14 Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm Tovissodé, Chénangnon Frédéric Diop, Aliou Glèlè Kakaï, Romain PLoS One Research Article Binary Generalized Linear Mixed Model (GLMM) is the most common method used by researchers to analyze clustered binary data in biological and social sciences. The traditional approach to GLMMs causes substantial bias in estimates due to steady shape of logistic and normal distribution assumptions thereby resulting into wrong and misleading decisions. This study brings forward an approach governed by skew generalized t distributions that belong to a class of potentially skewed and heavy tailed distributions. Interestingly, both the traditional logistic and probit mixed models, as well as other available methods can be utilized within the skew generalized t-link model (SGTLM) frame. We have taken advantage of the Expectation-Maximization algorithm accelerated via parameter-expansion for model fitting. We evaluated the performance of this approach to GLMMs through a simulation experiment by varying sample size and data distribution. Our findings indicated that the proposed methodology outperforms competing approaches in estimating population parameters and predicting random effects, when the traditional link and normality assumptions are violated. In addition, empirical standard errors and information criteria proved useful for detecting spurious skewness and avoiding complex models for probit data. An application with respiratory infection data points out to the superiority of the SGTLM which turns to be the most adequate model. In future, studies should focus on integrating the demonstrated flexibility in other generalized linear mixed models to enhance robust modeling. Public Library of Science 2021-04-06 /pmc/articles/PMC8028747/ /pubmed/33822818 http://dx.doi.org/10.1371/journal.pone.0249604 Text en © 2021 Tovissodé et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tovissodé, Chénangnon Frédéric
Diop, Aliou
Glèlè Kakaï, Romain
Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm
title Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm
title_full Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm
title_fullStr Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm
title_full_unstemmed Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm
title_short Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm
title_sort inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded em algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8028747/
https://www.ncbi.nlm.nih.gov/pubmed/33822818
http://dx.doi.org/10.1371/journal.pone.0249604
work_keys_str_mv AT tovissodechenangnonfrederic inferenceinskewgeneralizedtlinkmodelsforclusteredbinaryoutcomeviaaparameterexpandedemalgorithm
AT diopaliou inferenceinskewgeneralizedtlinkmodelsforclusteredbinaryoutcomeviaaparameterexpandedemalgorithm
AT glelekakairomain inferenceinskewgeneralizedtlinkmodelsforclusteredbinaryoutcomeviaaparameterexpandedemalgorithm