We deliberately did not provide explicit instructions to our participants about how to do the task. It is therefore possible that observers were using different strategies in different conditions, which might explain our pattern of results. To test for this possibility, we computed a measure of precision as the standard deviation of the fitted psychometric function for each participant in each experiment. If participants used the same strategy in all conditions, precision should be the same across conditions. We ran the same statistical tests on precision as those that were run on the PSEs. For experiment 1, we ran a repeated measures ANOVA on precision, which revealed significant differences across conditions (F(2,10) = 5.917, p = 0.020). Post hoc tests revealed that precision was higher for test arrays with medium circles than for test arrays of small circles (t(5) = 2.561, adjusted p = 0.057) and higher for test arrays with medium circles than test arrays with large element circles (t(5) = 3.512, adjusted p = 0.025). There was no significant difference between large and small element circles (t(5) = 0.708, adjusted p = 0.495). There are several reasons why reduced precision may have been observed when the elements in the reference and test arrays were different sizes (reference medium and test small or large). One possibility is that participants were using a mixed strategy for making their judgments. For example, on some trials using the centroids, and on other trials using the inner or outer edges. Across multiple trials, such a mixed strategy could on average yield equivalent results as simply using the centroid on every trial. Alternatively, it could be that the process of extracting and comparing the centroids for larger and/or smaller elements, or making comparison between different elements more generally, is simply more difficult than doing so when all elements are the same. These alternative hypotheses can be more specifically examined in experiments 2 to 5 in which the array elements all had the same size. For experiments 2 to 5, paired t-tests comparing precision between the two conditions in each experiment, revealed no significant differences across four experiments (smallest p = 0.077). This shows that, for the array size judgments our participants are doing, comparing array sizes of triangles of the same or different orientation does not influence precision, but does influence PSE, as demonstrated above.
In summary, although we have some evidence that strategy used by participants may vary among the conditions in experiment 1, we have no such evidence for any of the other experiments, and it is difficult to come up with a reasonable model for how differences in strategy would produce the pattern of results observed for the PSE. We conclude that our results are likely not driven by a difference in strategy, and that in the absence of explicit instruction, observers appear to adopt the centroid strategy.