Cargando…

The (Ir)Responsibility of (Under)Estimating Missing Data

It is practically impossible to avoid losing data in the course of an investigation, and it has been proven that the consequences can reach such magnitude that they could even invalidate the results of the study. This paper describes some of the most likely causes of missing data in research in the...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernández-García, María P., Vallejo-Seco, Guillermo, Livácic-Rojas, Pablo, Tuero-Herrero, Ellian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5920143/
https://www.ncbi.nlm.nih.gov/pubmed/29731731
http://dx.doi.org/10.3389/fpsyg.2018.00556
_version_ 1783317775365701632
author Fernández-García, María P.
Vallejo-Seco, Guillermo
Livácic-Rojas, Pablo
Tuero-Herrero, Ellian
author_facet Fernández-García, María P.
Vallejo-Seco, Guillermo
Livácic-Rojas, Pablo
Tuero-Herrero, Ellian
author_sort Fernández-García, María P.
collection PubMed
description It is practically impossible to avoid losing data in the course of an investigation, and it has been proven that the consequences can reach such magnitude that they could even invalidate the results of the study. This paper describes some of the most likely causes of missing data in research in the field of clinical psychology and the consequences they may have on statistical and substantive inferences. When it is necessary to recover the missing information, analyzing the data can become extremely complex. We summarize the experts' recommendations regarding the most powerful procedures for performing this task, the advantages each one has over the others, the elements that can or should influence our choice, and the procedures that are not a recommended option except in very exceptional cases. We conclude by offering four pieces of advice, on which all the experts agree and to which we must attend at all times in order to proceed with the greatest possible success. Finally, we show the pernicious effects produced by missing data on the statistical result and on the substantive or clinical conclusions. For this purpose we have planned to lose data in different percentage rates under two mechanisms of loss of data, MCAR and MAR in the complete data set of two very different real researchs, and we proceed to analyze the set of the available data, listwise deletion. One study is carried out using a quasi-experimental non-equivalent control group design, and another study using a experimental design completely randomized
format Online
Article
Text
id pubmed-5920143
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-59201432018-05-04 The (Ir)Responsibility of (Under)Estimating Missing Data Fernández-García, María P. Vallejo-Seco, Guillermo Livácic-Rojas, Pablo Tuero-Herrero, Ellian Front Psychol Psychology It is practically impossible to avoid losing data in the course of an investigation, and it has been proven that the consequences can reach such magnitude that they could even invalidate the results of the study. This paper describes some of the most likely causes of missing data in research in the field of clinical psychology and the consequences they may have on statistical and substantive inferences. When it is necessary to recover the missing information, analyzing the data can become extremely complex. We summarize the experts' recommendations regarding the most powerful procedures for performing this task, the advantages each one has over the others, the elements that can or should influence our choice, and the procedures that are not a recommended option except in very exceptional cases. We conclude by offering four pieces of advice, on which all the experts agree and to which we must attend at all times in order to proceed with the greatest possible success. Finally, we show the pernicious effects produced by missing data on the statistical result and on the substantive or clinical conclusions. For this purpose we have planned to lose data in different percentage rates under two mechanisms of loss of data, MCAR and MAR in the complete data set of two very different real researchs, and we proceed to analyze the set of the available data, listwise deletion. One study is carried out using a quasi-experimental non-equivalent control group design, and another study using a experimental design completely randomized Frontiers Media S.A. 2018-04-20 /pmc/articles/PMC5920143/ /pubmed/29731731 http://dx.doi.org/10.3389/fpsyg.2018.00556 Text en Copyright © 2018 Fernández-García, Vallejo-Seco, Livácic-Rojas and Tuero-Herrero. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Fernández-García, María P.
Vallejo-Seco, Guillermo
Livácic-Rojas, Pablo
Tuero-Herrero, Ellian
The (Ir)Responsibility of (Under)Estimating Missing Data
title The (Ir)Responsibility of (Under)Estimating Missing Data
title_full The (Ir)Responsibility of (Under)Estimating Missing Data
title_fullStr The (Ir)Responsibility of (Under)Estimating Missing Data
title_full_unstemmed The (Ir)Responsibility of (Under)Estimating Missing Data
title_short The (Ir)Responsibility of (Under)Estimating Missing Data
title_sort (ir)responsibility of (under)estimating missing data
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5920143/
https://www.ncbi.nlm.nih.gov/pubmed/29731731
http://dx.doi.org/10.3389/fpsyg.2018.00556
work_keys_str_mv AT fernandezgarciamariap theirresponsibilityofunderestimatingmissingdata
AT vallejosecoguillermo theirresponsibilityofunderestimatingmissingdata
AT livacicrojaspablo theirresponsibilityofunderestimatingmissingdata
AT tueroherreroellian theirresponsibilityofunderestimatingmissingdata
AT fernandezgarciamariap irresponsibilityofunderestimatingmissingdata
AT vallejosecoguillermo irresponsibilityofunderestimatingmissingdata
AT livacicrojaspablo irresponsibilityofunderestimatingmissingdata
AT tueroherreroellian irresponsibilityofunderestimatingmissingdata