Abstract
In everyday scenes, objects rarely appear in isolation. Clustered objects in a scene are less readily identified than objects in isolation. We form a summary representation of the group as a whole. What properties characterize this summary? Likely candidates are statistical descriptors including the mean, the range and the variance. Understanding how these statistics are extracted is one goal for the study of perception. In a series of experiments, we investigated the accuracy of judgments of the mean size of circles. First, participants adjusted a test circle to the estimated mean size of two target circles. Judgments of the perceived mean size followed a power function with an exponent of 0.76, consistent with results obtained by Teghtsoonian (1965). We used this psychological estimate of the mean in the remaining experiments. Next we measured thresholds for discriminating the mean size of a heterogeneous array, and compared them with thresholds for discriminating the size of elements in a homogeneous array, and the size of a single element. We found little difference between these three measures, suggesting that extracting the mean size might be an automatic process. We varied the exposure duration, and found much less Improvement with duration in thresholds for the mean size than in those for single elements or homogeneous displays. There may be internal noise in the averaging process that sets a ceiling on the improvement that is possible. In another experiment, we varied the distributions of sizes (normal, uniform, two peaks, and homogeneous). Thresholds were only slightly higher for comparing the means across two different distributions compared to within the same distribution, confirming that subjects were indeed averaging sizes. Finally, we investigated how many elements participants used in averaging by running a simulation, randomly sampling different numbers of elements. We found that samples of six elements gave the closest match to human performance.