Cargando…
Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data
Multiple imputation (MI) has been widely used for handling missing data in biomedical research. In the presence of high-dimensional data, regularized regression has been used as a natural strategy for building imputation models, but limited research has been conducted for handling general missing da...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751511/ https://www.ncbi.nlm.nih.gov/pubmed/26868061 http://dx.doi.org/10.1038/srep21689 |
_version_ | 1782415597630390272 |
---|---|
author | Deng, Yi Chang, Changgee Ido, Moges Seyoum Long, Qi |
author_facet | Deng, Yi Chang, Changgee Ido, Moges Seyoum Long, Qi |
author_sort | Deng, Yi |
collection | PubMed |
description | Multiple imputation (MI) has been widely used for handling missing data in biomedical research. In the presence of high-dimensional data, regularized regression has been used as a natural strategy for building imputation models, but limited research has been conducted for handling general missing data patterns where multiple variables have missing values. Using the idea of multiple imputation by chained equations (MICE), we investigate two approaches of using regularized regression to impute missing values of high-dimensional data that can handle general missing data patterns. We compare our MICE methods with several existing imputation methods in simulation studies. Our simulation results demonstrate the superiority of the proposed MICE approach based on an indirect use of regularized regression in terms of bias. We further illustrate the proposed methods using two data examples. |
format | Online Article Text |
id | pubmed-4751511 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-47515112016-02-22 Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data Deng, Yi Chang, Changgee Ido, Moges Seyoum Long, Qi Sci Rep Article Multiple imputation (MI) has been widely used for handling missing data in biomedical research. In the presence of high-dimensional data, regularized regression has been used as a natural strategy for building imputation models, but limited research has been conducted for handling general missing data patterns where multiple variables have missing values. Using the idea of multiple imputation by chained equations (MICE), we investigate two approaches of using regularized regression to impute missing values of high-dimensional data that can handle general missing data patterns. We compare our MICE methods with several existing imputation methods in simulation studies. Our simulation results demonstrate the superiority of the proposed MICE approach based on an indirect use of regularized regression in terms of bias. We further illustrate the proposed methods using two data examples. Nature Publishing Group 2016-02-12 /pmc/articles/PMC4751511/ /pubmed/26868061 http://dx.doi.org/10.1038/srep21689 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Deng, Yi Chang, Changgee Ido, Moges Seyoum Long, Qi Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data |
title | Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data |
title_full | Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data |
title_fullStr | Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data |
title_full_unstemmed | Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data |
title_short | Multiple Imputation for General Missing Data Patterns in the Presence of High-dimensional Data |
title_sort | multiple imputation for general missing data patterns in the presence of high-dimensional data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751511/ https://www.ncbi.nlm.nih.gov/pubmed/26868061 http://dx.doi.org/10.1038/srep21689 |
work_keys_str_mv | AT dengyi multipleimputationforgeneralmissingdatapatternsinthepresenceofhighdimensionaldata AT changchanggee multipleimputationforgeneralmissingdatapatternsinthepresenceofhighdimensionaldata AT idomogesseyoum multipleimputationforgeneralmissingdatapatternsinthepresenceofhighdimensionaldata AT longqi multipleimputationforgeneralmissingdatapatternsinthepresenceofhighdimensionaldata |