Perceptually inferring what is behind occlusion is called amodal completion (Kanizsa,
1979). The representation of a partially occluded shape, presumably the result of amodal completion, was investigated by Sekuler and Palmer (
1992) in a priming study. A partially occluded shape, such as a
pie chart (or a pacman) that is perceived as a corner occluding a disk, was first presented. Two shapes were then presented, and observers were asked whether they were the same or different from each other. Discrimination speed was found to be approximately equal when the two shapes were both pacmen and when they were both disks. The authors suggested that the representation of a pacman was its perceived counterpart, the disk, which facilitated subsequent comparison between two disks. However, given that the test sequence was repeated, there was a confound: the pacman may have been simply associated in memory with the disks that followed, independent of any perceptual completion. Note also that the discrimination speed of the two disks was equal to, but not greater than, that of the two pacmen. In the present study, each trial was shown only once, thereby avoiding the confound. More accurate, rather than equally fast, recognition was sought in the present study when the study and test items were different, relative to when they were the same.
Nakayama, Shimojo, and Silverman (
1989) studied the role of amodal completion in face recognition by either allowing or disabling amodal completion (see also He & Nakayama,
1992; Kellman & Shipley,
1991). These researchers stereoscopically switched the relative depth between a face image and a curtain blind. Face recognition deteriorated when a fragmented face was perceived in front of a wall, as compared to when an amodally completed face was perceived behind curtain blind. In the present study, because of the presumed strong perceptual completion of faces, faces were also chosen as the initial class of experimental objects.
The boundary extension effect discovered by Intraub and Richardson (
1989) can also be considered as amodal completion of a photo's boundary. What is unique about this effect is that the photo of a natural scene needs to be a close-up view. For instance, after seeing a close-up photo of a plate of spaghetti, an observer's drawing from memory recall expanded the boundary of the original photo, as if the mind's eye view zoomed out. In terms of signal detection theory (Green & Swets,
1974), however, bias and sensitivity were not teased apart. Accordingly, it remains unresolved whether this effect may be entirely due to observers' preference for a farther view when a photo appears too close up. In the present study, the issue of sensitivity versus bias would be specifically addressed.
An observer's ability to amodally complete was quantified by Kersten (
1987) using natural images, including faces. A grayscale image was partially occluded by randomly positioned large pixels that were easily distinguishable from the natural image behind. Observers repeatedly estimated the grayscale value of an occluded pixel with feedback, until they were correct. The number of guesses was then used to quantify an observer's ability to predict an occluded pixel value from the rest of the image. It was found that, when occlusion was sparse (1%), the value of an occluded pixel was well predicted by its nearest neighbors. The prediction was robust regardless of whether the mean or median was computed. The model in the present paper was inspired by the Kersten study, in that the value of an occluded pixel was also estimated by the model using the nearest unoccluded pixels.
The overall aim of the current study, that is, determining whether perceptual abstraction via amodal completion may better characterize the internal representation of a seen image than the image itself, can be traced back to the classic study of Posner and Keele (
1970). There, participants were trained to categorize random-dot configurations, each of which was created by randomly perturbing the dot positions from a predetermined configuration, termed a prototype. During training, the prototypes were never shown. Participants were tested immediately after training and one week afterwards. Classification errors for the trained exemplars were found to increase in one week's time (.14 to .39), whereas those for the prototypes changed little (.35 to .38). Posner and Keele suggested that the representations of the categories were not simply the trained exemplars; rather, the average of the trained exemplars, or the prototypes, also seemed represented, and in a more stable manner. However, as indicated by the numerical error rates, the never-seen prototypes were never categorized more accurately than the trained exemplars. In the present study, we sought to find conditions under which unseen stimuli may be recognized more accurately than the studied stimuli.
In a companion paper to the present study, Lu and Liu (
2008) designed old-new rating experiments to study the effect of occlusion on recognition memory. In contrast to the same-different matching task in the present study, a standard technique in memory research was used to study longer-term internal representations. Participants first rated the attractiveness of a face or natural scene, and then rated how likely a scene had been seen. In comparison to the present study's red pixel occluders, red rectangles were used. Lu and Liu (
2008) demonstrated that an “old” face or scene whose image had not been seen but less occluded was more accurately recognized as “old” than a more occluded image, identical to that previously seen.