Figure 2 shows classification images for the first (blue) and last (red) learning sessions. The y-axis corresponds to the estimated relative weights an observer assigned to the stimulus elements. The x-axis corresponds to the elements' position away from the horizontal meridian. Bottom rows show individual profiles, fitted with an exponential function. The top row shows the grand average classification images, averaged across individuals. For the grand average classification images, we show both average decision weights (A) and normalized decision weights for comparison with the ideal observer (B). Panel C plots classification images for all five sessions and shows the progressive change of the perceptual template across sessions. All observers initially relied highly (in some cases, such as observers AM and JR, almost solely) on the most informative elements (highest SNR; SNR hot spot), which are the farthest from the meridian (top and bottom elements). Learning changed the perceptual template to also include elements distant from the top and bottom elements. Width of the fitted template grew on average 124%,
t(9) = 3.45,
p = 0.007. Amplitudes of the fits did not show systematic changes,
t(9) = −0.561,
p = 0.59. Average classification images show that observers' perceptual templates changed across sessions, expanding from the initial SNR hot spot toward the central meridian. Hotelling's
T2 test, a multivariate generalization of the univariate Student's
t, showed that the template change was significant,
T2(8, 9) = 3490,
p = 0.01. About five out of 10 observers showed significant (
p < 0.05) template changes at the individual level. Multiple univariate two-tailed
t-tests corrected for multiple comparisons using the false discovery rate correction (Benjamini & Hochberg,
1995) at
α = 0.05 show that, across sessions, the perceptual weights changed at locations close to the initial SNR hot spot (locations 4–7) while no significant change in weighting was found for the top and bottom element (location 8).
We further analyzed any potential effects of baseline tilt (signal strength) on the classification images (see
Appendix B). No significant differences were found when using a false discovery rate of
α = 0.05. Because the weights for the two mirror-symmetric parts of the stimulus (bottom and top halves) did not differ significantly,
T2(8, 9) = 14.29,
p = 0.845, they were averaged. Last, we compared classification images that were analyzed from trials with line baseline tilt toward the right versus left and found no significant differences due to left/right baseline tilt,
T2(8, 9) = 65.54,
p = 0.4, or toward center and periphery,
T2(8, 9) = 384.45,
p = 0.09 (see
Appendix B).