The intersections where the grid lines terminate on the target contour would stimulate “end-stopped” cells (
Hubel & Wiesel, 1965) that have an inhibitory zone at one end of their oriented receptive fields. These cells respond to oriented contours as long as the contour does not extend into their inhibitory zone.
Pack, Livingstone, Duffy, and Born (2003) demonstrated that many end-stopped V1 neurons responded to the motion direction of the contour endpoint independently of the contour's orientation. In MT,
Pack, Gartland, and Born (2004) found that cells were more strongly influenced by motion signals from terminators, more so than by the ambiguous signals from the contours themselves, although this dominance took some time (60 ms) to appear (
Pack & Born, 2001). The end-stopped motion selective cells can respond to corners and other two-dimensional features such as the intersections of superimposed gratings (e.g,
Movshon & Newsome, 1996) producing pattern, rather than component directional responses. These responses may disambiguate an object's motion when the 2D features are intrinsic to the object (
Shimojo et al., 1989). However, when they are extrinsic terminators, such as the points of intersection between the grid lines and the target contours in the furrow stimulus—they do not belong to the object and are instead accidents of occlusion that should be ignored—but are not. Indeed, the MT neurons recorded by
Pack et al. (2004) did responded to terminators that were intrinsic to a moving object more than to those that were extrinsic, accidents of occlusion. However,
Pack et al. (2004) were recording mostly from receptive fields in central vision where the furrow illusion is eliminated. Therefore one plausible explanation for the furrow illusion is that in the periphery, the parsing of motion signals arising from intrinsic versus extrinsic features does not occur. The two motion signals are then simply averaged to produce an illusory direction for the target (
Shapiro et al., 2010;
Kwon, Tadin, & Knill, 2015). We are currently exploring other stimuli to test this assumption of the breakdown of parsing in the periphery and for brief presentations as well (less than 60 ms,
Pack & Born, 2001).