Cargando…

Underestimation of Pearson’s product moment correlation statistic

Pearson’s product moment correlation coefficient (more commonly Pearson’s r) tends to underestimate correlations that exist in the underlying population. This phenomenon is generally unappreciated in studies of ecology, although a range of corrections are suggested in the statistical literature. The...

Descripción completa

Detalles Bibliográficos
Autores principales: Humphreys, Rosalind K., Puth, Marie-Therese, Neuhäuser, Markus, Ruxton, Graeme D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323088/
https://www.ncbi.nlm.nih.gov/pubmed/30062565
http://dx.doi.org/10.1007/s00442-018-4233-0
_version_ 1783385688957255680
author Humphreys, Rosalind K.
Puth, Marie-Therese
Neuhäuser, Markus
Ruxton, Graeme D.
author_facet Humphreys, Rosalind K.
Puth, Marie-Therese
Neuhäuser, Markus
Ruxton, Graeme D.
author_sort Humphreys, Rosalind K.
collection PubMed
description Pearson’s product moment correlation coefficient (more commonly Pearson’s r) tends to underestimate correlations that exist in the underlying population. This phenomenon is generally unappreciated in studies of ecology, although a range of corrections are suggested in the statistical literature. The use of Pearson’s r as the classical measure for correlation is widespread in ecology, where manipulative experiments are impractical across the large spatial scales concerned; it is therefore vital that ecologists are able to use this correlation measure as effectively as possible. Here, our literature review suggests that corrections for the issue of underestimation in Pearson’s r should not be adopted if either the data deviate from bivariate normality or sample size is greater than around 30. Through our simulations, we then aim to offer advice to researchers in ecology on situations where both distributions can be described as normal, but sample sizes are lower than around 30. We found that none of the methods currently offered in the literature to correct the underestimation bias offer consistently reliable performance, and so we do not recommend that they be implemented when making inferences about the behaviour of a population from a sample. We also suggest that, when considering the importance of the bias towards underestimation in Pearson’s product moment correlation coefficient for biological conclusions, the likely extent of the bias should be discussed. Unless sample size is very small, the issue of sample bias is unlikely to call for substantial modification of study conclusions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00442-018-4233-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6323088
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-63230882019-01-22 Underestimation of Pearson’s product moment correlation statistic Humphreys, Rosalind K. Puth, Marie-Therese Neuhäuser, Markus Ruxton, Graeme D. Oecologia Highlighted Student Research Pearson’s product moment correlation coefficient (more commonly Pearson’s r) tends to underestimate correlations that exist in the underlying population. This phenomenon is generally unappreciated in studies of ecology, although a range of corrections are suggested in the statistical literature. The use of Pearson’s r as the classical measure for correlation is widespread in ecology, where manipulative experiments are impractical across the large spatial scales concerned; it is therefore vital that ecologists are able to use this correlation measure as effectively as possible. Here, our literature review suggests that corrections for the issue of underestimation in Pearson’s r should not be adopted if either the data deviate from bivariate normality or sample size is greater than around 30. Through our simulations, we then aim to offer advice to researchers in ecology on situations where both distributions can be described as normal, but sample sizes are lower than around 30. We found that none of the methods currently offered in the literature to correct the underestimation bias offer consistently reliable performance, and so we do not recommend that they be implemented when making inferences about the behaviour of a population from a sample. We also suggest that, when considering the importance of the bias towards underestimation in Pearson’s product moment correlation coefficient for biological conclusions, the likely extent of the bias should be discussed. Unless sample size is very small, the issue of sample bias is unlikely to call for substantial modification of study conclusions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00442-018-4233-0) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2018-07-30 2019 /pmc/articles/PMC6323088/ /pubmed/30062565 http://dx.doi.org/10.1007/s00442-018-4233-0 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Highlighted Student Research
Humphreys, Rosalind K.
Puth, Marie-Therese
Neuhäuser, Markus
Ruxton, Graeme D.
Underestimation of Pearson’s product moment correlation statistic
title Underestimation of Pearson’s product moment correlation statistic
title_full Underestimation of Pearson’s product moment correlation statistic
title_fullStr Underestimation of Pearson’s product moment correlation statistic
title_full_unstemmed Underestimation of Pearson’s product moment correlation statistic
title_short Underestimation of Pearson’s product moment correlation statistic
title_sort underestimation of pearson’s product moment correlation statistic
topic Highlighted Student Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323088/
https://www.ncbi.nlm.nih.gov/pubmed/30062565
http://dx.doi.org/10.1007/s00442-018-4233-0
work_keys_str_mv AT humphreysrosalindk underestimationofpearsonsproductmomentcorrelationstatistic
AT puthmarietherese underestimationofpearsonsproductmomentcorrelationstatistic
AT neuhausermarkus underestimationofpearsonsproductmomentcorrelationstatistic
AT ruxtongraemed underestimationofpearsonsproductmomentcorrelationstatistic