Cargando…
Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water
Missing data are a major problem in the behavioral neurosciences, particularly when data collection is costly. Often researchers exclude cases with missing data, which can result in biased estimates and reduced power. Trying to avoid the deletion of a case because of a missing data point can be cond...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Faculty for Undergraduate Neuroscience
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592650/ https://www.ncbi.nlm.nih.gov/pubmed/23493038 |
_version_ | 1782262151005601792 |
---|---|
author | Rubin, Leah H. Witkiewitz, Katie Andre, Justin St. Reilly, Steve |
author_facet | Rubin, Leah H. Witkiewitz, Katie Andre, Justin St. Reilly, Steve |
author_sort | Rubin, Leah H. |
collection | PubMed |
description | Missing data are a major problem in the behavioral neurosciences, particularly when data collection is costly. Often researchers exclude cases with missing data, which can result in biased estimates and reduced power. Trying to avoid the deletion of a case because of a missing data point can be conducted, but implementing a naïve missing data method can result in distorted estimates and incorrect conclusions. New approaches for handling missing data have been developed but these techniques are not typically included in undergraduate research methods texts. The topic of missing data techniques would be useful for teaching research methods and for helping students with their research projects. This paper aimed to illustrate that estimating missing data is often more efficacious than complete case analysis, otherwise known as listwise deletion. Longitudinal data was obtained from an experiment examining the effects of an anorectic drug on food consumption in a small sample (n=17) of rats. The complete dataset was degraded by removing a percentage of datapoints (1–5%, 10%). Four missing data techniques: listwise deletion, mean substitution, regression, and expectation-maximization (EM) were applied to all six datasets to ensure that each approach was applied to the same missing data points. P-values, effect sizes, and Bayes factors were computed. Results demonstrated listwise deletion was the least effective method. EM and regression imputation were the preferred methods when more than 5% of the data were missing. Based on these findings it is recommended that researchers avoid using listwise deletion and consider alternative missing data techniques. |
format | Online Article Text |
id | pubmed-3592650 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Faculty for Undergraduate Neuroscience |
record_format | MEDLINE/PubMed |
spelling | pubmed-35926502013-03-14 Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water Rubin, Leah H. Witkiewitz, Katie Andre, Justin St. Reilly, Steve J Undergrad Neurosci Educ Articles Missing data are a major problem in the behavioral neurosciences, particularly when data collection is costly. Often researchers exclude cases with missing data, which can result in biased estimates and reduced power. Trying to avoid the deletion of a case because of a missing data point can be conducted, but implementing a naïve missing data method can result in distorted estimates and incorrect conclusions. New approaches for handling missing data have been developed but these techniques are not typically included in undergraduate research methods texts. The topic of missing data techniques would be useful for teaching research methods and for helping students with their research projects. This paper aimed to illustrate that estimating missing data is often more efficacious than complete case analysis, otherwise known as listwise deletion. Longitudinal data was obtained from an experiment examining the effects of an anorectic drug on food consumption in a small sample (n=17) of rats. The complete dataset was degraded by removing a percentage of datapoints (1–5%, 10%). Four missing data techniques: listwise deletion, mean substitution, regression, and expectation-maximization (EM) were applied to all six datasets to ensure that each approach was applied to the same missing data points. P-values, effect sizes, and Bayes factors were computed. Results demonstrated listwise deletion was the least effective method. EM and regression imputation were the preferred methods when more than 5% of the data were missing. Based on these findings it is recommended that researchers avoid using listwise deletion and consider alternative missing data techniques. Faculty for Undergraduate Neuroscience 2007-06-15 /pmc/articles/PMC3592650/ /pubmed/23493038 Text en Copyright © 2007 Faculty for Undergraduate Neuroscience |
spellingShingle | Articles Rubin, Leah H. Witkiewitz, Katie Andre, Justin St. Reilly, Steve Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water |
title | Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water |
title_full | Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water |
title_fullStr | Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water |
title_full_unstemmed | Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water |
title_short | Methods for Handling Missing Data in the Behavioral Neurosciences: Don’t Throw the Baby Rat out with the Bath Water |
title_sort | methods for handling missing data in the behavioral neurosciences: don’t throw the baby rat out with the bath water |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592650/ https://www.ncbi.nlm.nih.gov/pubmed/23493038 |
work_keys_str_mv | AT rubinleahh methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater AT witkiewitzkatie methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater AT andrejustinst methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater AT reillysteve methodsforhandlingmissingdatainthebehavioralneurosciencesdontthrowthebabyratoutwiththebathwater |