TY - JOUR
T1 - Grouped circular data in biology
T2 - advice for effectively implementing statistical procedures
AU - Landler, Lukas
AU - Ruxton, Graeme D.
AU - Malkemper, E. Pascal
N1 - Open access funding provided by Austrian Science Fund (FWF). LL was partially funded by the Austrian Science Fund (FWF, Grant Number: P32586).
PY - 2020/7/20
Y1 - 2020/7/20
N2 - The most common statistical procedure with a sample of circular data is
to test the null hypothesis that points are spread uniformly around the
circle without a preferred direction. An array of tests for this has
been developed. However, these tests were designed for continuously
distributed data, whereas often (e.g. due to limited precision of
measurement techniques) collected data is aggregated into a set of
discrete values (e.g. rounded to the nearest degree). This disparity can
cause an uncontrolled increase in type I error rate, an effect that is
particularly problematic for tests that are based on the distribution of
arc lengths between adjacent points (such as the Rao spacing test).
Here, we demonstrate that an easy-to-apply modification can correct this
problem, and we recommend this modification when using any test, other
than the Rayleigh test, of circular uniformity on aggregated data. We
provide R functions for this modification for several commonly
used tests. In addition, we tested the power of a recently proposed
test, the Gini test. However, we concluded that it lacks sufficient
increase in power to replace any of the tests already in common use. In
conclusion, using any of the standard circular tests (except the
Rayleigh test) without modifications on rounded/aggregated data,
especially with larger sample sizes, will increase the proportion of
false-positive results—but we demonstrate that a simple and general
modification avoids this problem.
AB - The most common statistical procedure with a sample of circular data is
to test the null hypothesis that points are spread uniformly around the
circle without a preferred direction. An array of tests for this has
been developed. However, these tests were designed for continuously
distributed data, whereas often (e.g. due to limited precision of
measurement techniques) collected data is aggregated into a set of
discrete values (e.g. rounded to the nearest degree). This disparity can
cause an uncontrolled increase in type I error rate, an effect that is
particularly problematic for tests that are based on the distribution of
arc lengths between adjacent points (such as the Rao spacing test).
Here, we demonstrate that an easy-to-apply modification can correct this
problem, and we recommend this modification when using any test, other
than the Rayleigh test, of circular uniformity on aggregated data. We
provide R functions for this modification for several commonly
used tests. In addition, we tested the power of a recently proposed
test, the Gini test. However, we concluded that it lacks sufficient
increase in power to replace any of the tests already in common use. In
conclusion, using any of the standard circular tests (except the
Rayleigh test) without modifications on rounded/aggregated data,
especially with larger sample sizes, will increase the proportion of
false-positive results—but we demonstrate that a simple and general
modification avoids this problem.
KW - Gini test
KW - Hermans-Rasson test
KW - Rao’s spacing test
KW - Rayleigh test
KW - Rounding error
KW - Type I error
U2 - 10.1007/s00265-020-02881-6
DO - 10.1007/s00265-020-02881-6
M3 - Article
AN - SCOPUS:85088256830
SN - 0340-5443
VL - 74
JO - Behavioral Ecology and Sociobiology
JF - Behavioral Ecology and Sociobiology
IS - 8
M1 - 100
ER -