Cargando…

Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination

In conventional structural equation modeling (SEM), with the presence of even a tiny amount of data contamination due to outliers or influential observations, normal-theory maximum likelihood (ML-Normal) is not efficient and can be severely biased. The multivariate-t-based SEM, which recently got im...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Mark H. C., Zhang, Jiaqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5532449/
https://www.ncbi.nlm.nih.gov/pubmed/28804470
http://dx.doi.org/10.3389/fpsyg.2017.01286
_version_ 1783253463928406016
author Lai, Mark H. C.
Zhang, Jiaqi
author_facet Lai, Mark H. C.
Zhang, Jiaqi
author_sort Lai, Mark H. C.
collection PubMed
description In conventional structural equation modeling (SEM), with the presence of even a tiny amount of data contamination due to outliers or influential observations, normal-theory maximum likelihood (ML-Normal) is not efficient and can be severely biased. The multivariate-t-based SEM, which recently got implemented in Mplus as an approach for mixture modeling, represents a robust estimation alternative to downweigh the impact of outliers and influential observations. To our knowledge, the use of maximum likelihood estimation with a multivariate-t model (ML-t) to handle outliers has not been shown in SEM literature. In this paper we demonstrate the use of ML-t using the classic Holzinger and Swineford (1939) data set with a few observations modified as outliers or influential observations. A simulation study is then conducted to examine the performance of fit indices and information criteria under ML-Normal and ML-t in the presence of outliers. Results showed that whereas all fit indices got worse for ML-Normal with increasing amount of outliers and influential observations, their values were relatively stable with ML-t, and the use of information criteria was effective in selecting ML-normal without data contamination and selecting ML-t with data contamination, especially when the sample size was at least 200.
format Online
Article
Text
id pubmed-5532449
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-55324492017-08-11 Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination Lai, Mark H. C. Zhang, Jiaqi Front Psychol Psychology In conventional structural equation modeling (SEM), with the presence of even a tiny amount of data contamination due to outliers or influential observations, normal-theory maximum likelihood (ML-Normal) is not efficient and can be severely biased. The multivariate-t-based SEM, which recently got implemented in Mplus as an approach for mixture modeling, represents a robust estimation alternative to downweigh the impact of outliers and influential observations. To our knowledge, the use of maximum likelihood estimation with a multivariate-t model (ML-t) to handle outliers has not been shown in SEM literature. In this paper we demonstrate the use of ML-t using the classic Holzinger and Swineford (1939) data set with a few observations modified as outliers or influential observations. A simulation study is then conducted to examine the performance of fit indices and information criteria under ML-Normal and ML-t in the presence of outliers. Results showed that whereas all fit indices got worse for ML-Normal with increasing amount of outliers and influential observations, their values were relatively stable with ML-t, and the use of information criteria was effective in selecting ML-normal without data contamination and selecting ML-t with data contamination, especially when the sample size was at least 200. Frontiers Media S.A. 2017-07-28 /pmc/articles/PMC5532449/ /pubmed/28804470 http://dx.doi.org/10.3389/fpsyg.2017.01286 Text en Copyright © 2017 Lai and Zhang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Lai, Mark H. C.
Zhang, Jiaqi
Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination
title Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination
title_full Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination
title_fullStr Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination
title_full_unstemmed Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination
title_short Evaluating Fit Indices for Multivariate t-Based Structural Equation Modeling with Data Contamination
title_sort evaluating fit indices for multivariate t-based structural equation modeling with data contamination
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5532449/
https://www.ncbi.nlm.nih.gov/pubmed/28804470
http://dx.doi.org/10.3389/fpsyg.2017.01286
work_keys_str_mv AT laimarkhc evaluatingfitindicesformultivariatetbasedstructuralequationmodelingwithdatacontamination
AT zhangjiaqi evaluatingfitindicesformultivariatetbasedstructuralequationmodelingwithdatacontamination