First, as mentioned, our null results cannot be explained by a
lack of power because we can detect significant differences up to
r2 = 0.07, which according to Cohen (
1988) is a medium effect size. Moreover, we would have a priori expected the tests to be highly correlated with
r2 much higher than 0.07. Second, a Bayesian analysis showed that the null hypothesis was more likely than the alternative hypothesis for each of the null findings. Third, our null results cannot be explained by high intraobserver variance because we found high and significant correlations in test–retest conditions in the range of
r2 = 0.42 to 0.68 (
Table 2). In addition, BM5 and BM25 are strongly correlated with an
r2 of 0.304. Fourth, a PCA showed that there is no common
multivariate factor behind the tasks. Fifth, null correlations can occur when unmeasured confounding variables correlate positively with one measured variable and negatively with another. It is in general impossible to rule out that there is a hidden cause behind variables. In our case, however, we would have expected the various vision tests to be directly correlated because of their basic nature. Sixth, zero correlations may occur when data are not linearly related. However, inspection of our data shows that with few exceptions (e.g., FrACT vs. vernier) the test pairs were bivariate-normally distributed and did not show any salient nonlinearities. Kolmogorov–Smirnov tests on the univariate distributions confirmed their normalcy (vernier:
p = 0.07; BM5:
p = 0.25; BM25:
p = 0.20; bisection:
p = 0.28; Gabor:
p = 0.79; FrACT:
p = 0.79). In addition, our results cannot be explained by outliers. Seventh, we have sufficient variance in our student data to avoid ceiling effects. The variance in our student sample is similar to other well-sighted populations. In the visual acuity test, the mean was 1.35 and
SD was 0.32 (
Table 1), which is comparable with a much larger sample of 817 student observers from our laboratory database (mean = 1.31,
SD = 0.34) and with 138 healthy participants from the general population who served as age- and education-matched controls in experiments researching schizophrenia (mean = 1.37,
SD = 0.36). Hence, our student sample can be considered to be representative of the population of normal, well-sighted or corrected-to-normal observers.