Cargando…
Grouped circular data in biology: advice for effectively implementing statistical procedures
ABSTRACT: The most common statistical procedure with a sample of circular data is to test the null hypothesis that points are spread uniformly around the circle without a preferred direction. An array of tests for this has been developed. However, these tests were designed for continuously distribut...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7373216/ https://www.ncbi.nlm.nih.gov/pubmed/32728310 http://dx.doi.org/10.1007/s00265-020-02881-6 |
_version_ | 1783561459944390656 |
---|---|
author | Landler, Lukas Ruxton, Graeme D. Malkemper, E. Pascal |
author_facet | Landler, Lukas Ruxton, Graeme D. Malkemper, E. Pascal |
author_sort | Landler, Lukas |
collection | PubMed |
description | ABSTRACT: The most common statistical procedure with a sample of circular data is to test the null hypothesis that points are spread uniformly around the circle without a preferred direction. An array of tests for this has been developed. However, these tests were designed for continuously distributed data, whereas often (e.g. due to limited precision of measurement techniques) collected data is aggregated into a set of discrete values (e.g. rounded to the nearest degree). This disparity can cause an uncontrolled increase in type I error rate, an effect that is particularly problematic for tests that are based on the distribution of arc lengths between adjacent points (such as the Rao spacing test). Here, we demonstrate that an easy-to-apply modification can correct this problem, and we recommend this modification when using any test, other than the Rayleigh test, of circular uniformity on aggregated data. We provide R functions for this modification for several commonly used tests. In addition, we tested the power of a recently proposed test, the Gini test. However, we concluded that it lacks sufficient increase in power to replace any of the tests already in common use. In conclusion, using any of the standard circular tests (except the Rayleigh test) without modifications on rounded/aggregated data, especially with larger sample sizes, will increase the proportion of false-positive results—but we demonstrate that a simple and general modification avoids this problem. SIGNIFICANCE STATEMENT: Circular data are widespread across biological disciplines, e.g. in orientation studies or circadian rhythms. Often these data are rounded to the nearest 1–10 degrees. We have shown previously that this leads to an inflation of false-positive results when testing whether the data is significantly different from a random distribution using the Rao test. Here we present a modification that avoids this increase in false-positives for rounded data while retaining statistical power for a variety of tests. In sum, we provide comprehensive guidance on how best to test for departure from uniformity in non-continuous data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00265-020-02881-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-7373216 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-73732162020-07-27 Grouped circular data in biology: advice for effectively implementing statistical procedures Landler, Lukas Ruxton, Graeme D. Malkemper, E. Pascal Behav Ecol Sociobiol Methods Papers ABSTRACT: The most common statistical procedure with a sample of circular data is to test the null hypothesis that points are spread uniformly around the circle without a preferred direction. An array of tests for this has been developed. However, these tests were designed for continuously distributed data, whereas often (e.g. due to limited precision of measurement techniques) collected data is aggregated into a set of discrete values (e.g. rounded to the nearest degree). This disparity can cause an uncontrolled increase in type I error rate, an effect that is particularly problematic for tests that are based on the distribution of arc lengths between adjacent points (such as the Rao spacing test). Here, we demonstrate that an easy-to-apply modification can correct this problem, and we recommend this modification when using any test, other than the Rayleigh test, of circular uniformity on aggregated data. We provide R functions for this modification for several commonly used tests. In addition, we tested the power of a recently proposed test, the Gini test. However, we concluded that it lacks sufficient increase in power to replace any of the tests already in common use. In conclusion, using any of the standard circular tests (except the Rayleigh test) without modifications on rounded/aggregated data, especially with larger sample sizes, will increase the proportion of false-positive results—but we demonstrate that a simple and general modification avoids this problem. SIGNIFICANCE STATEMENT: Circular data are widespread across biological disciplines, e.g. in orientation studies or circadian rhythms. Often these data are rounded to the nearest 1–10 degrees. We have shown previously that this leads to an inflation of false-positive results when testing whether the data is significantly different from a random distribution using the Rao test. Here we present a modification that avoids this increase in false-positives for rounded data while retaining statistical power for a variety of tests. In sum, we provide comprehensive guidance on how best to test for departure from uniformity in non-continuous data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00265-020-02881-6) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2020-07-20 2020 /pmc/articles/PMC7373216/ /pubmed/32728310 http://dx.doi.org/10.1007/s00265-020-02881-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Methods Papers Landler, Lukas Ruxton, Graeme D. Malkemper, E. Pascal Grouped circular data in biology: advice for effectively implementing statistical procedures |
title | Grouped circular data in biology: advice for effectively implementing statistical procedures |
title_full | Grouped circular data in biology: advice for effectively implementing statistical procedures |
title_fullStr | Grouped circular data in biology: advice for effectively implementing statistical procedures |
title_full_unstemmed | Grouped circular data in biology: advice for effectively implementing statistical procedures |
title_short | Grouped circular data in biology: advice for effectively implementing statistical procedures |
title_sort | grouped circular data in biology: advice for effectively implementing statistical procedures |
topic | Methods Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7373216/ https://www.ncbi.nlm.nih.gov/pubmed/32728310 http://dx.doi.org/10.1007/s00265-020-02881-6 |
work_keys_str_mv | AT landlerlukas groupedcirculardatainbiologyadviceforeffectivelyimplementingstatisticalprocedures AT ruxtongraemed groupedcirculardatainbiologyadviceforeffectivelyimplementingstatisticalprocedures AT malkemperepascal groupedcirculardatainbiologyadviceforeffectivelyimplementingstatisticalprocedures |