Cargando…

Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata

BACKGROUND: In modern health care systems, the computerization of all aspects of clinical care has led to the development of large data repositories. For example, in the UK, large primary care databases hold millions of electronic medical records, with detailed information on diagnoses, treatments,...

Descripción completa

Detalles Bibliográficos
Autores principales: Kontopantelis, Evangelos, Parisi, Rosa, Springate, David A., Reeves, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5234260/
https://www.ncbi.nlm.nih.gov/pubmed/28086961
http://dx.doi.org/10.1186/s13104-016-2365-z
_version_ 1782494971564130304
author Kontopantelis, Evangelos
Parisi, Rosa
Springate, David A.
Reeves, David
author_facet Kontopantelis, Evangelos
Parisi, Rosa
Springate, David A.
Reeves, David
author_sort Kontopantelis, Evangelos
collection PubMed
description BACKGROUND: In modern health care systems, the computerization of all aspects of clinical care has led to the development of large data repositories. For example, in the UK, large primary care databases hold millions of electronic medical records, with detailed information on diagnoses, treatments, outcomes and consultations. Careful analyses of these observational datasets of routinely collected data can complement evidence from clinical trials or even answer research questions that cannot been addressed in an experimental setting. However, ‘missingness’ is a common problem for routinely collected data, especially for biological parameters over time. Absence of complete data for the whole of a individual’s study period is a potential bias risk and standard complete-case approaches may lead to biased estimates. However, the structure of the data values makes standard cross-sectional multiple-imputation approaches unsuitable. In this paper we propose and evaluate mibmi, a new command for cleaning and imputing longitudinal body mass index data. RESULTS: The regression-based data cleaning aspects of the algorithm can be useful when researchers analyze messy longitudinal data. Although the multiple imputation algorithm is computationally expensive, it performed similarly or even better to existing alternatives, when interpolating observations. CONCLUSION: The mibmi algorithm can be a useful tool for analyzing longitudinal body mass index data, or other longitudinal data with very low individual-level variability.
format Online
Article
Text
id pubmed-5234260
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52342602017-01-17 Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata Kontopantelis, Evangelos Parisi, Rosa Springate, David A. Reeves, David BMC Res Notes Technical Note BACKGROUND: In modern health care systems, the computerization of all aspects of clinical care has led to the development of large data repositories. For example, in the UK, large primary care databases hold millions of electronic medical records, with detailed information on diagnoses, treatments, outcomes and consultations. Careful analyses of these observational datasets of routinely collected data can complement evidence from clinical trials or even answer research questions that cannot been addressed in an experimental setting. However, ‘missingness’ is a common problem for routinely collected data, especially for biological parameters over time. Absence of complete data for the whole of a individual’s study period is a potential bias risk and standard complete-case approaches may lead to biased estimates. However, the structure of the data values makes standard cross-sectional multiple-imputation approaches unsuitable. In this paper we propose and evaluate mibmi, a new command for cleaning and imputing longitudinal body mass index data. RESULTS: The regression-based data cleaning aspects of the algorithm can be useful when researchers analyze messy longitudinal data. Although the multiple imputation algorithm is computationally expensive, it performed similarly or even better to existing alternatives, when interpolating observations. CONCLUSION: The mibmi algorithm can be a useful tool for analyzing longitudinal body mass index data, or other longitudinal data with very low individual-level variability. BioMed Central 2017-01-13 /pmc/articles/PMC5234260/ /pubmed/28086961 http://dx.doi.org/10.1186/s13104-016-2365-z Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Kontopantelis, Evangelos
Parisi, Rosa
Springate, David A.
Reeves, David
Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata
title Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata
title_full Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata
title_fullStr Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata
title_full_unstemmed Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata
title_short Longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in Stata
title_sort longitudinal multiple imputation approaches for body mass index or other variables with very low individual-level variability: the mibmi command in stata
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5234260/
https://www.ncbi.nlm.nih.gov/pubmed/28086961
http://dx.doi.org/10.1186/s13104-016-2365-z
work_keys_str_mv AT kontopantelisevangelos longitudinalmultipleimputationapproachesforbodymassindexorothervariableswithverylowindividuallevelvariabilitythemibmicommandinstata
AT parisirosa longitudinalmultipleimputationapproachesforbodymassindexorothervariableswithverylowindividuallevelvariabilitythemibmicommandinstata
AT springatedavida longitudinalmultipleimputationapproachesforbodymassindexorothervariableswithverylowindividuallevelvariabilitythemibmicommandinstata
AT reevesdavid longitudinalmultipleimputationapproachesforbodymassindexorothervariableswithverylowindividuallevelvariabilitythemibmicommandinstata