Although it has been reported that each cue contributes to the perception of motion-in-depth, there has been significant controversy about relative contributions of these cues. Some studies have claimed that the contribution of CD is dominant and that of IOVD is very weak or nonexistent (e.g., Cumming & Parker,
1994; Nefs & Harris,
2010; Nefs, O'Hare, & Harris,
2010; Regan,
1993), while others have reported the opposite results (e.g., Brooks,
2002; Czuba, Rokers, Guillet, Huk, & Cormack,
2011; Sakano, Allison, & Howard,
2012; Sakano, Allison, Howard, & Sadr,
2006; Wardle & Alais,
2013). One reason why past studies seem to be at odds may be that the mechanisms processing the two cues have different temporal frequency tuning (Shioiri, Nakajima, Kakehi, & Yaguchi,
2008) and the studies used stimuli with different temporal characteristics. Another reason, not necessarily excluding the first one, might be that multiple stages of processing or multiple mechanisms with different characteristics could be involved in the perception of motion-in-depth, even for one cue, as is the case with the mechanisms for the perception of lateral motion (Culham, Verstraten, Ashida, & Cavanagh,
2000; Nishida & Ashida,
2000). In the latter case, if the researchers used different methods and/or stimuli, the results could differ depending on the mechanism stimulated. In the present study, we focused on possible mechanisms specialized to detect motion-in-depth based on binocular cues. Specialized mechanisms imply that motion-in-depth is processed by dedicated low-level detector networks rather than by a succession of static depth percepts derived from binocular disparity (i.e., tracking of position in depth; Patterson,
1999). In other words, they are mechanisms selective for the direction of motion-in-depth (i.e., tuned to motion-in-depth; Czuba et al.,
2011).