Cargando…

The effect of missing levels of nesting in multilevel analysis

Multilevel analysis is an appropriate and powerful tool for analyzing hierarchical structure data widely applied from public health to genomic data. In practice, however, we may lose the information on multiple nesting levels in the multilevel analysis since data may fail to capture all levels of hi...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Seho, Chung, Yujin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korea Genome Organization 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9576476/
https://www.ncbi.nlm.nih.gov/pubmed/36239111
http://dx.doi.org/10.5808/gi.22052
_version_ 1784811536796090368
author Park, Seho
Chung, Yujin
author_facet Park, Seho
Chung, Yujin
author_sort Park, Seho
collection PubMed
description Multilevel analysis is an appropriate and powerful tool for analyzing hierarchical structure data widely applied from public health to genomic data. In practice, however, we may lose the information on multiple nesting levels in the multilevel analysis since data may fail to capture all levels of hierarchy, or the top or intermediate levels of hierarchy are ignored in the analysis. In this study, we consider a multilevel linear mixed effect model (LMM) with single imputation that can involve all data hierarchy levels in the presence of missing top or intermediate-level clusters. We evaluate and compare the performance of a multilevel LMM with single imputation with other models ignoring the data hierarchy or missing intermediate-level clusters. To this end, we applied a multilevel LMM with single imputation and other models to hierarchically structured cohort data with some intermediate levels missing and to simulated data with various cluster sizes and missing rates of intermediate-level clusters. A thorough simulation study demonstrated that an LMM with single imputation estimates fixed coefficients and variance components of a multilevel model more accurately than other models ignoring data hierarchy or missing clusters in terms of mean squared error and coverage probability. In particular, when models ignoring data hierarchy or missing clusters were applied, the variance components of random effects were overestimated. We observed similar results from the analysis of hierarchically structured cohort data.
format Online
Article
Text
id pubmed-9576476
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Korea Genome Organization
record_format MEDLINE/PubMed
spelling pubmed-95764762022-10-19 The effect of missing levels of nesting in multilevel analysis Park, Seho Chung, Yujin Genomics Inform Original Article Multilevel analysis is an appropriate and powerful tool for analyzing hierarchical structure data widely applied from public health to genomic data. In practice, however, we may lose the information on multiple nesting levels in the multilevel analysis since data may fail to capture all levels of hierarchy, or the top or intermediate levels of hierarchy are ignored in the analysis. In this study, we consider a multilevel linear mixed effect model (LMM) with single imputation that can involve all data hierarchy levels in the presence of missing top or intermediate-level clusters. We evaluate and compare the performance of a multilevel LMM with single imputation with other models ignoring the data hierarchy or missing intermediate-level clusters. To this end, we applied a multilevel LMM with single imputation and other models to hierarchically structured cohort data with some intermediate levels missing and to simulated data with various cluster sizes and missing rates of intermediate-level clusters. A thorough simulation study demonstrated that an LMM with single imputation estimates fixed coefficients and variance components of a multilevel model more accurately than other models ignoring data hierarchy or missing clusters in terms of mean squared error and coverage probability. In particular, when models ignoring data hierarchy or missing clusters were applied, the variance components of random effects were overestimated. We observed similar results from the analysis of hierarchically structured cohort data. Korea Genome Organization 2022-09-30 /pmc/articles/PMC9576476/ /pubmed/36239111 http://dx.doi.org/10.5808/gi.22052 Text en (c) 2022, Korea Genome Organization https://creativecommons.org/licenses/by/4.0/(CC) This is an open-access article distributed under the terms of the Creative Commons Attribution license(https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Park, Seho
Chung, Yujin
The effect of missing levels of nesting in multilevel analysis
title The effect of missing levels of nesting in multilevel analysis
title_full The effect of missing levels of nesting in multilevel analysis
title_fullStr The effect of missing levels of nesting in multilevel analysis
title_full_unstemmed The effect of missing levels of nesting in multilevel analysis
title_short The effect of missing levels of nesting in multilevel analysis
title_sort effect of missing levels of nesting in multilevel analysis
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9576476/
https://www.ncbi.nlm.nih.gov/pubmed/36239111
http://dx.doi.org/10.5808/gi.22052
work_keys_str_mv AT parkseho theeffectofmissinglevelsofnestinginmultilevelanalysis
AT chungyujin theeffectofmissinglevelsofnestinginmultilevelanalysis
AT parkseho effectofmissinglevelsofnestinginmultilevelanalysis
AT chungyujin effectofmissinglevelsofnestinginmultilevelanalysis