In any case, to interpret our findings that common-onset masking altered not the discriminability but the suprathreshold appearance of the target, it is necessary to extend the notion of object updating to a temporally evolving process of suprathreshold appearance as follows. Here we tentatively focus on the representational dynamics at V1, the first stage of explicit orientation representations. First, the direct responses to the target onset initiate the formation of a representation of target orientation faithful to the light distribution on the retina. If the target orientation is vertical, for example, a population of orientation-selective V1 neurons initially represents the vertical orientation. This formation process is not disrupted by backward masking (see
Enns, 2004). Next, since the initial internal representation is subliminal due to its weakness and instability, the signal-to-noise ratio of the target must be increased by some time-consuming process, perhaps involved in reentrant activities between V1 and higher-order visual areas, as assumed in the object substitution theory (
Di Lollo et al., 2000; see also
Lamme & Roelfsema, 2000). Parallel to this, contextual modulation from inducers coming up later causes the internal representation to evolve slowly and be repelled from the inducer orientation. The divisive normalization model (
Goddard et al., 2008;
Schwartz et al., 2009) dictates that, if the target orientation is vertical and the inducer orientation is 20° (clockwise from the vertical), for example, the responses of V1 neurons that prefer the target location and 20° orientation are selectively normalized within a certain period; this tuned normalization process renders the orientation represented at a population level tilted counterclockwise from the vertical. The responses of V1 neurons are also known to be sluggishly modulated by contextual stimuli outside their classical receptive fields through feedback from higher-order visual areas (see
Lamme, 1995;
Zipser et al., 1996). The former signal-enhancement process involved in discriminability evolution is completed earlier because of two possible reasons: the discrimination of target orientation may be performed without a time-consuming feature integration process (see
Bouvier & Treisman, 2010); and object individuation between the target and mask may be easy and fast because of their dissimilarity (see
Goodhew et al., 2015). In any case, our stimulus configuration, including the use of a Gabor target that could be discriminable only based on the orientation information and looked dissimilar to the four-dot mask, might contribute to the fast evolution of discriminability. Conversely, if a target was more complex and looked more similar to a mask, it might take more time to render a visual content discriminable and thus become affected more severely by the mask. Here we assume that discriminability evolution takes less time than appearance evolution involving the latter contextual modulation process. In the simultaneous-offset condition, both the discriminability and appearance evolution are completed because there is no interruption by backward masking; thus, the maximal repulsion is reported. In contrast, in the delayed-offset condition, the establishment of object correspondence between the target and mask triggers the process of object updating (see e.g.,
Goodhew, 2017). Object updating compulsorily terminates appearance evolution, rendering contextual modulation weaker. If backward masking interrupts the reentrant activities between V1 and higher-order visual areas (
Boehler, Schoenfeld, Heinze, & Hopf, 2008;
Di Lollo et al., 2000;
Lamme et al., 2002), the termination of evolution might occur in visual areas as early as V1. However, the target is sufficiently discriminable because discriminability evolution is completed prior to object updating. In other words, the target orientation has already been represented as a preconscious form beyond a subliminal form by the time the updating occurs. Consequently, the premature representation is consciously accessed; thus, reduced repulsion is reported. We should note that
Clifford and Harris (2005) reported a superficially related finding that orientation repulsion was reduced when not a target, but an inducer was masked and gone unnoticed. Despite their intriguing claim that orientation repulsion involves a multilevel processing hierarchy, their manipulation of masking did not reveal a temporally evolving process underlying repulsion and masking, whereas we did for the first time by masking the target.