Although second-order biological motion appears to be weaker than performance for first-order when both stimuli are presented at maximum contrast (
Experiment 1), we hypothesized that this was a result of differential visibility of the local motions in the stimulus, resulting in differential weighting at higher stages of the motion system, rather than any qualitative difference between the pathways or a preference for first-order input to the biological motion system. We directly tested this idea by combining both first- and second-order dots within the same stimulus (
Experiment 2). This technique has previously been used to show cue-invariance in global motion processing (Ledgeway et al.,
2002). We show in this paper that there are visibility-dependent interactions between the two pathways for biological motion, strongly supporting the idea of a cue-invariant biological motion system.
In addition, naive observers could discriminate the direction of both normal and scrambled second-order walkers, provided the stimuli were not inverted (
Experiment 3). This pattern of results is identical to that obtained for first-order motion and suggests that biological motion of either type is subject to an inversion effect that operates on the local motion of individual dots, particularly those of the feet (Troje & Westhoff,
2006).
Our results suggest that biological motion is carried out by cue-invariant mechanisms and that differences in performance reported between first- and second-order biological motion are a result of the differential sensitivity of the visual system to these two different types of local motion, rather than any difference in first- or second-order biological motion processing
per se. This is consistent with the level at which biological motion analysis is thought to be taking place in the visual system. Several imaging and rTMS studies have highlighted the importance of posterior superior temporal sulcus (pSTS) to biological motion perception (Grèzes et al.,
2001; Grossman et al.,
2000,
2005; Pelphrey et al.,
2005). This is an area that is likely to receive input from V5. One study has suggested a role for area VP/V3v (Servos et al.,
2002) an extrastriate area with connections to V5. Area V5 appears to be active during biological motion perception, but not preferentially. There is much evidence for cue-invariance in V5/MT (Albright,
1992; Stoner & Albright,
1992) and MST (Aaen-Stockdale, Ledgeway, & Hess,
2007; Geesaman & Andersen,
1996) so it would be surprising if biological-motion selective areas higher than this, such as pSTS, responded differentially to first- and second-order motion.
This leaves open the question of why exactly second-order local motion is weaker than first-order. The difference in sensitivity could result from the fact that second-order detectors respond to first-order motion in addition to second-order motion. This would mean that there are more mechanisms detecting first-order motion than second-order motion. First-order thresholds may therefore reflect a pooling of the responses from both first-order and second-order mechanisms and may be lower due to probability summation.
It has been suggested that the differences between first- and second-order processing could be the result of a delay in processing second-order motion, caused by the additional rectification necessary (Yo & Wilson,
1992). Other authors suggest that it is a result of more severe low-pass temporal filtering of second-order motion (Derrington, Badcock, & Henning,
1993). However, neither of these explanations fully explain the data. A more complete explanation may be that second-order motion mechanisms are simply less direction selective. Ledgeway and Hess (
2002) showed that second-order mechanisms have poorer direction selectivity than first-order mechanisms.
Our study does not really contribute to this debate, as we are dealing with processing at a global level, by which point it appears that first- and second-order signals have been combined. The direction-selectivity hypothesis may, however, explain the relative weight of motion signals from first- and second-order stimuli. It is entirely possible that by reducing the luminance of the first-order dots, we are smearing the temporal energy in the first-order dots to the point where discrimination of their motion direction is equivalent to that obtained for second-order dots.
Both first- and second-order stimuli show an inversion effect for detection and discrimination. The inversion effect was originally considered to be caused by a failure of configural processing (Dittrich,
1993; Proffitt & Bertenthal,
1990; Reed, Stone, Bozova, & Tanaka,
2003), but the fact that an inversion effect occurs with scrambled stimuli suggests that, in addition to global form inversion effects, there is a second inversion effect mediated by local motion trajectories, which reveals a mechanism tuned to the distinctive motion of the lower extremities of an animal in locomotion (Troje & Westhoff,
2006). Previous studies have used the presence of inversion effects to infer intact biological motion processing in clinical populations (Neri, Luu, & Levi,
2007; Thompson, Troje, Hansen, & Hess,
2008). Our data show the presence of both configural and local motion trajectory-based inversion effects for second-order biological motion, a pattern of results that is identical to that obtained with first-order stimuli. This suggests that the same mechanism is responsible for processing both types of stimulus.
Our results are consistent with, and substantially extend, the findings of Mather et al. (
1992). There are, however, inconsistencies between our results (especially
Experiment 1) and those of Ahlström et al. (
1997). While Ahlström et al. (
1997) argue that there is no difference in performance between first- and second-order, we have demonstrated that there is a difference, but this is dependent upon the relative visibility of the local motion signals. The difference between our data and theirs can be explained by differences in stimulus construction.
Our second-order stimuli are composed of dots within which the contrast of the noise background is increased. The comparable stimulus used by Ahlström et al. (
1997) (the “texture human”) was constructed from dots composed of patches of random noise that moved across a background noise field of the same average luminance. Whilst in any single frame, the dots were indistinguishable from the background, the movement of these regions across a background would still have generated first-order motion signals, making it difficult to separate the relative influences of the two systems.
A further consideration is the ceiling effect evident in the data presented by Ahlström et al. (
1997). The
d′ scores reported for most conditions were ∼3, which signifies near-perfect performance (Macmillan & Creelman,
1991). This high level of performance may have been due to the absence of masking dots, the use of stimuli in which actors were performing highly informative actions and long presentation durations (>1 sec).
In the present study, we demonstrated that the differences between first- and second-order motion evident in previous studies, and replicated here (
Experiment 1), can be explained by differential visibility of first- and second-order motion signals to an otherwise cue-invariant biological motion mechanism. When the visibility of local motion signals are equated, the two pathways interact predictably.