The target was a 4-c/° horizontal sine wave, windowed with a radial raised-cosine function whose diameter was 0.8° at 120 pixels/°. The background was a square patch of white noise with a width of 3°. The target was presented at the center of the display together with the white-noise background. The stimuli and the experimental conditions were optimized for the psychophysical experiment. First, the white-noise background consisted of 4 × 4-pixel squares (2 × 2 minute), whose gray level was independently sampled from a Gaussian distribution.
1 Using a virtual pixel size of 2 × 2 minute approximately matched the noise energy to the optics of the eye, thus allowing the noise to have a larger effect on performance. Second, there were only eight levels of target amplitude and background contrast instead of the 50 levels used in the simulations. We confirmed that the computational results extend to this sparser sampling. The eight logarithmically spaced levels of target amplitudes were 4, 5, 6, 8, 10, 12, 15, and 19 gray levels (with a mean gray level of 128, the corresponding RMS contrasts were 0.9%, 1.1%, 1.3%, 1.7%, 2.1%, 2.6%, and 4.0%). The eight levels of background gray levels were 5.12, 8.96, 12.80, 16.64, 20.48, 24.32, 28.16, and 32.00 (corresponding RMS background contrasts were 4%, 7%, 10%, 13%, 16%, 19%, 22%, and 25%) (
Figure 5A).
We also conducted the same experiment with natural-scene backgrounds.
Sebastian et al. (2017) binned millions of natural-image patches along three dimensions: mean luminance, RMS contrast, and amplitude-spectrum similarity to the target, all computed from the target region. On average, every natural background in a bin has almost the same luminance, contrast, and similarity to the target (with minor variations). We created RMS background contrast uncertainty by sampling across eight bins, each with a different mean RMS contrast. However, the luminance and similarity were held at their median value (sixth luminance bin and fifth similarity bin out of 10 bins along each dimension). The backgrounds had a mean gray level of 59. The standard deviations of the gray level of bins in the experiment were 3.25, 3.91, 6.83, 8.22, 9.90, 11.92, 14.35, and 17.28 (the corresponding RMS contrasts were 6%, 7%, 12%, 14%, 17%, 20%, 24%, and 29%). Target amplitudes were adjusted so that the average accuracy was approximately 75% correct. The amplitudes were 2, 3, 4, 5, 7, 9, 13, and 17 gray levels (corresponding RMS contrasts were 0.9%, 1.4%, 1.8%, 2.3%, 3.2%, 4.1%, and 7.8%). Each background was sampled from appropriate bins without replacement, so participants never saw the same background twice. We made sure that the target with the smallest amplitude contained at least four different discrete gray levels and that the discretization has negligible effects on model observers’ discriminability.
The stimuli were presented at a distance of 171 cm with a resolution of 120 pixels/°. In the white-noise experiment, the mean luminance of the background was 44.87 cd/m
2. For natural backgrounds, the mean luminance was 20.7 cd/m
2. Images were gamma corrected based on the calibration of the display device (GDM-FW900; Sony, Tokyo, Japan) and quantized to 256 gray levels. The screen refresh rate was 85 hertz. All experiments and analyses were done using custom code written in MATLAB (MathWorks, Natick MA), using the Psychophysics Toolbox (
Brainard, 1997;
Pelli, 1997).
Four experienced observers completed the experiment with the white-noise backgrounds, and three of them completed the experiment with natural-scene backgrounds. They had normal (corrected) spatial acuity. Written, informed consent was obtained for all observers in accordance with The University of Texas at Austin Institutional Review Board.
The procedures in both experiments were the same. Participants were asked to press a key to deliver their decision about whether the target was present or absent in each trial. Each trial started with a dim fixation point at the center, presented for 400 ms, followed by a 100-ms blank window. The stimulus was presented for 250 ms (
Figure 5C). Participants were given 1 second to indicate their decision. If a participant failed to respond in this time window, the trial was excluded from the analysis. The number of excluded trials was less than 1% of the total trials. The subsequent trial began after an additional 500 ms. The stimulus presentation time was selected to match the average human fixation duration during a visual search. A high-frequency feedback tone was provided if the participant's response was correct, and a low-frequency if the response was incorrect. On each trial, the target amplitude and the background contrast were sampled randomly from the 64 possible combinations (see
Figure 5A); however, the sampling was counterbalanced so each condition had the same number of trials.
In each block of the experiment, the prior probability of the target was either 0.5 or 0.2, and participants were informed before starting the block. Every participant completed three blocks for each target prior condition (one participant only joined the white-noise experiment and completed two blocks). The order of the blocks was randomized. In a single block, there was a total of 640 trials. When the target prior probability was 0.5, the number of trials for each unique combination of contrast and amplitude level (total of 64 conditions in which the target was present) was 5. When the target prior probability was 0.2, it was 2.