Cargando…

Class enumeration false positive in skew-t family of continuous growth mixture models

Growth Mixture Modeling (GMM) has gained great popularity in the last decades as a methodology for longitudinal data analysis. The usual assumption of normally distributed repeated measures has been shown as problematic in real-life data applications. Namely, performing normal GMM on data that is ev...

Descripción completa

Detalles Bibliográficos
Autores principales: Guerra-Peña, Kiero, García-Batista, Zoilo Emilio, Depaoli, Sarah, Garrido, Luis Eduardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7164627/
https://www.ncbi.nlm.nih.gov/pubmed/32302350
http://dx.doi.org/10.1371/journal.pone.0231525
_version_ 1783523325340811264
author Guerra-Peña, Kiero
García-Batista, Zoilo Emilio
Depaoli, Sarah
Garrido, Luis Eduardo
author_facet Guerra-Peña, Kiero
García-Batista, Zoilo Emilio
Depaoli, Sarah
Garrido, Luis Eduardo
author_sort Guerra-Peña, Kiero
collection PubMed
description Growth Mixture Modeling (GMM) has gained great popularity in the last decades as a methodology for longitudinal data analysis. The usual assumption of normally distributed repeated measures has been shown as problematic in real-life data applications. Namely, performing normal GMM on data that is even slightly skewed can lead to an over selection of the number of latent classes. In order to ameliorate this unwanted result, GMM based on the skew t family of continuous distributions has been proposed. This family of distributions includes the normal, skew normal, t, and skew t. This simulation study aims to determine the efficiency of selecting the “true” number of latent groups in GMM based on the skew t family of continuous distributions, using fit indices and likelihood ratio tests. Results show that the skew t GMM was the only model considered that showed fit indices and LRT false positive rates under the 0.05 cutoff value across sample sizes and for normal, and skewed and kurtic data. Simulation results are corroborated by a real educational data application example. These findings favor the development of practical guides of the benefits and risks of using the GMM based on this family of distributions.
format Online
Article
Text
id pubmed-7164627
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71646272020-04-22 Class enumeration false positive in skew-t family of continuous growth mixture models Guerra-Peña, Kiero García-Batista, Zoilo Emilio Depaoli, Sarah Garrido, Luis Eduardo PLoS One Research Article Growth Mixture Modeling (GMM) has gained great popularity in the last decades as a methodology for longitudinal data analysis. The usual assumption of normally distributed repeated measures has been shown as problematic in real-life data applications. Namely, performing normal GMM on data that is even slightly skewed can lead to an over selection of the number of latent classes. In order to ameliorate this unwanted result, GMM based on the skew t family of continuous distributions has been proposed. This family of distributions includes the normal, skew normal, t, and skew t. This simulation study aims to determine the efficiency of selecting the “true” number of latent groups in GMM based on the skew t family of continuous distributions, using fit indices and likelihood ratio tests. Results show that the skew t GMM was the only model considered that showed fit indices and LRT false positive rates under the 0.05 cutoff value across sample sizes and for normal, and skewed and kurtic data. Simulation results are corroborated by a real educational data application example. These findings favor the development of practical guides of the benefits and risks of using the GMM based on this family of distributions. Public Library of Science 2020-04-17 /pmc/articles/PMC7164627/ /pubmed/32302350 http://dx.doi.org/10.1371/journal.pone.0231525 Text en © 2020 Guerra-Peña et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Guerra-Peña, Kiero
García-Batista, Zoilo Emilio
Depaoli, Sarah
Garrido, Luis Eduardo
Class enumeration false positive in skew-t family of continuous growth mixture models
title Class enumeration false positive in skew-t family of continuous growth mixture models
title_full Class enumeration false positive in skew-t family of continuous growth mixture models
title_fullStr Class enumeration false positive in skew-t family of continuous growth mixture models
title_full_unstemmed Class enumeration false positive in skew-t family of continuous growth mixture models
title_short Class enumeration false positive in skew-t family of continuous growth mixture models
title_sort class enumeration false positive in skew-t family of continuous growth mixture models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7164627/
https://www.ncbi.nlm.nih.gov/pubmed/32302350
http://dx.doi.org/10.1371/journal.pone.0231525
work_keys_str_mv AT guerrapenakiero classenumerationfalsepositiveinskewtfamilyofcontinuousgrowthmixturemodels
AT garciabatistazoiloemilio classenumerationfalsepositiveinskewtfamilyofcontinuousgrowthmixturemodels
AT depaolisarah classenumerationfalsepositiveinskewtfamilyofcontinuousgrowthmixturemodels
AT garridoluiseduardo classenumerationfalsepositiveinskewtfamilyofcontinuousgrowthmixturemodels