Cargando…

Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study

Missing data commonly occur in large epidemiologic studies. Ignoring incompleteness or handling the data inappropriately may bias study results, reduce power and efficiency, and alter important risk/benefit relationships. Standard ways of dealing with missing values, such as complete case analysis (...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yang, De, Anindya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4945131/
https://www.ncbi.nlm.nih.gov/pubmed/27429686
http://dx.doi.org/10.6000/1929-6029.2015.04.03.7
_version_ 1782442872870535168
author Liu, Yang
De, Anindya
author_facet Liu, Yang
De, Anindya
author_sort Liu, Yang
collection PubMed
description Missing data commonly occur in large epidemiologic studies. Ignoring incompleteness or handling the data inappropriately may bias study results, reduce power and efficiency, and alter important risk/benefit relationships. Standard ways of dealing with missing values, such as complete case analysis (CCA), are generally inappropriate due to the loss of precision and risk of bias. Multiple imputation by fully conditional specification (FCS MI) is a powerful and statistically valid method for creating imputations in large data sets which include both categorical and continuous variables. It specifies the multivariate imputation model on a variable-by-variable basis and offers a principled yet flexible method of addressing missing data, which is particularly useful for large data sets with complex data structures. However, FCS MI is still rarely used in epidemiology, and few practical resources exist to guide researchers in the implementation of this technique. We demonstrate the application of FCS MI in support of a large epidemiologic study evaluating national blood utilization patterns in a sub-Saharan African country. A number of practical tips and guidelines for implementing FCS MI based on this experience are described.
format Online
Article
Text
id pubmed-4945131
institution National Center for Biotechnology Information
language English
publishDate 2015
record_format MEDLINE/PubMed
spelling pubmed-49451312016-07-14 Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study Liu, Yang De, Anindya Int J Stat Med Res Article Missing data commonly occur in large epidemiologic studies. Ignoring incompleteness or handling the data inappropriately may bias study results, reduce power and efficiency, and alter important risk/benefit relationships. Standard ways of dealing with missing values, such as complete case analysis (CCA), are generally inappropriate due to the loss of precision and risk of bias. Multiple imputation by fully conditional specification (FCS MI) is a powerful and statistically valid method for creating imputations in large data sets which include both categorical and continuous variables. It specifies the multivariate imputation model on a variable-by-variable basis and offers a principled yet flexible method of addressing missing data, which is particularly useful for large data sets with complex data structures. However, FCS MI is still rarely used in epidemiology, and few practical resources exist to guide researchers in the implementation of this technique. We demonstrate the application of FCS MI in support of a large epidemiologic study evaluating national blood utilization patterns in a sub-Saharan African country. A number of practical tips and guidelines for implementing FCS MI based on this experience are described. 2015-08-19 2015 /pmc/articles/PMC4945131/ /pubmed/27429686 http://dx.doi.org/10.6000/1929-6029.2015.04.03.7 Text en http://creativecommons.org/licenses/by-nc/3.0/ This is an open access article licensed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted, non-commercial use, distribution and reproduction in any medium, provided the work is properly cited.
spellingShingle Article
Liu, Yang
De, Anindya
Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study
title Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study
title_full Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study
title_fullStr Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study
title_full_unstemmed Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study
title_short Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study
title_sort multiple imputation by fully conditional specification for dealing with missing data in a large epidemiologic study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4945131/
https://www.ncbi.nlm.nih.gov/pubmed/27429686
http://dx.doi.org/10.6000/1929-6029.2015.04.03.7
work_keys_str_mv AT liuyang multipleimputationbyfullyconditionalspecificationfordealingwithmissingdatainalargeepidemiologicstudy
AT deanindya multipleimputationbyfullyconditionalspecificationfordealingwithmissingdatainalargeepidemiologicstudy