Cargando…

Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water

Missing data are a major problem in the behavioral neurosciences, particularly when data collection is costly. Often researchers exclude cases with missing data, which can result in biased estimates and reduced power. Trying to avoid the deletion of a case because of a missing data point can be cond...

Descripción completa

Detalles Bibliográficos
Autores principales: Rubin, Leah H., Witkiewitz, Katie, Andre, Justin St., Reilly, Steve
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Faculty for Undergraduate Neuroscience 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592650/
https://www.ncbi.nlm.nih.gov/pubmed/23493038
_version_ 1782262151005601792
author Rubin, Leah H.
Witkiewitz, Katie
Andre, Justin St.
Reilly, Steve
author_facet Rubin, Leah H.
Witkiewitz, Katie
Andre, Justin St.
Reilly, Steve
author_sort Rubin, Leah H.
collection PubMed
description Missing data are a major problem in the behavioral neurosciences, particularly when data collection is costly. Often researchers exclude cases with missing data, which can result in biased estimates and reduced power. Trying to avoid the deletion of a case because of a missing data point can be conducted, but implementing a naïve missing data method can result in distorted estimates and incorrect conclusions. New approaches for handling missing data have been developed but these techniques are not typically included in undergraduate research methods texts. The topic of missing data techniques would be useful for teaching research methods and for helping students with their research projects. This paper aimed to illustrate that estimating missing data is often more efficacious than complete case analysis, otherwise known as listwise deletion. Longitudinal data was obtained from an experiment examining the effects of an anorectic drug on food consumption in a small sample (n=17) of rats. The complete dataset was degraded by removing a percentage of datapoints (1–5%, 10%). Four missing data techniques: listwise deletion, mean substitution, regression, and expectation-maximization (EM) were applied to all six datasets to ensure that each approach was applied to the same missing data points. P-values, effect sizes, and Bayes factors were computed. Results demonstrated listwise deletion was the least effective method. EM and regression imputation were the preferred methods when more than 5% of the data were missing. Based on these findings it is recommended that researchers avoid using listwise deletion and consider alternative missing data techniques.
format Online
Article
Text
id pubmed-3592650
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Faculty for Undergraduate Neuroscience
record_format MEDLINE/PubMed
spelling pubmed-35926502013-03-14 Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water Rubin, Leah H. Witkiewitz, Katie Andre, Justin St. Reilly, Steve J Undergrad Neurosci Educ Articles Missing data are a major problem in the behavioral neurosciences, particularly when data collection is costly. Often researchers exclude cases with missing data, which can result in biased estimates and reduced power. Trying to avoid the deletion of a case because of a missing data point can be conducted, but implementing a naïve missing data method can result in distorted estimates and incorrect conclusions. New approaches for handling missing data have been developed but these techniques are not typically included in undergraduate research methods texts. The topic of missing data techniques would be useful for teaching research methods and for helping students with their research projects. This paper aimed to illustrate that estimating missing data is often more efficacious than complete case analysis, otherwise known as listwise deletion. Longitudinal data was obtained from an experiment examining the effects of an anorectic drug on food consumption in a small sample (n=17) of rats. The complete dataset was degraded by removing a percentage of datapoints (1–5%, 10%). Four missing data techniques: listwise deletion, mean substitution, regression, and expectation-maximization (EM) were applied to all six datasets to ensure that each approach was applied to the same missing data points. P-values, effect sizes, and Bayes factors were computed. Results demonstrated listwise deletion was the least effective method. EM and regression imputation were the preferred methods when more than 5% of the data were missing. Based on these findings it is recommended that researchers avoid using listwise deletion and consider alternative missing data techniques. Faculty for Undergraduate Neuroscience 2007-06-15 /pmc/articles/PMC3592650/ /pubmed/23493038 Text en Copyright © 2007 Faculty for Undergraduate Neuroscience
spellingShingle Articles
Rubin, Leah H.
Witkiewitz, Katie
Andre, Justin St.
Reilly, Steve
Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water
title Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water
title_full Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water
title_fullStr Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water
title_full_unstemmed Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water
title_short Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water
title_sort methods for handling missing data in the behavioral neurosciences: don’t throw the baby rat out with the bath water
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592650/
https://www.ncbi.nlm.nih.gov/pubmed/23493038
work_keys_str_mv AT rubinleahh methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater
AT witkiewitzkatie methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater
AT andrejustinst methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater
AT reillysteve methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater