This study aimed to clarify whether superior estimation of arrival time (
Experiment 1) and the performance of SPEM (
Experiment 2) for a descending object require directionally congruent information between the object's motion and the observer's leg side. In
Experiment 1, the arrival time estimation at the goal for the downward motion of 1 G and 0 G targets exhibited higher accuracy than that for the upward motion of 1 G and 0 G targets in the upright and supine postures. In
Experiment 2, the performance of SPEM for the downward motion of 1 G and 0 G targets was more accurate than that for the upward motion in both postures. These results suggest that congruence between the direction of target motion detected by visual cues and the direction of the observer's leg side along the frontoparallel plane is required to perform accurate arrival time estimation and SPEM for objects in downward motion. In other words, the direction of gravity from vestibular cues does not assist in estimating the arrival time or the performance of SPEM for objects in downward motion. These findings represent the first investigation to demonstrate that vertical asymmetry in the arrival time estimation and in SPEM performance depends not on the gravity axis, but on the egocentric vertical (observer's longitudinal body) axis.
Both arrival time estimation and SPEMs demonstrated superior performance under 1 G and 0 G conditions when the target moved downward, as compared with when it moved upward (
Figures 2 and
5). These results suggest that the superior arrival time estimation for the downward motion of targets under 1 G and 0 G in
Experiment 1 may be induced by the better performance of downward SPEM. Better SPEM enhances visual clarity, particularly when tracking moving objects (
Fooken, Yeo, Pai, & Spering, 2016). Additionally, previous studies have demonstrated that interception performance for a moving object is more accurate when participants actively track the object (
Spering et al., 2011). Thus, the vertical asymmetry in SPEM performance may be a major factor in inducing the vertical asymmetry in arrival time estimation, TTC estimation, or interception performance for the upward and downward motion of objects.
Previous studies noted that the response to TTC for an approaching object was observed before the object reached the contact area (
Benguigui, Ripoll, & Broderick, 2003;
Mcleod & Ross, 1983;
Schiff & Oldak, 1990;
Neuhoff, 2001). In contrast, our result from
Experiment 1 under 1 G and 0 G conditions showed that participants responded after a target arrived at the goal. The difference between the current study and previous studies may be attributed to whether the target motion is approaching, in conjunction with the requirements of SPEM and its performance. The early response (underestimation) of TTC for an approaching object has been interpreted by the “margin of safety theory” as an adaptive response that allows the observer to have enough time to engage in an appropriate response to the approaching object (
Neuhoff, 1998,
Neuhoff, 2001;
Vagnoni, Lingard, Munro, & Longo, 2020). This underestimation is advantageous for survival because a response for an approaching object that is too late is far more dangerous than responding too early (
Haselton & Nettle, 2006;
Vagnoni, et al., 2020). In our study, the target was presented along the frontoparallel plane of a participant, suggesting that the “margin of safety theory” did not contribute to the arrival time estimation. In contrast, SPEM performance during tracking might be a cause of delayed responses. The mean PDs in
Experiment 2 were negative, indicating that SPEM performance was insufficient and the participant's gaze did not catch up to the moving target (
Figures 4 and
5). A similar situation must have occurred in
Experiment 1, where the target velocities under the 1 G and 0 G conditions near the goal were faster than those in
Experiment 2. Thus, the insufficient SPEM performance may have caused the late key response under the 1 G and 0 G conditions in
Experiment 1. In the case of the looming target motion to which observers do not have to make SPEM to track the object, it is conceivable that the eyes were on the target. Taken together, underperformance of SPEM, which potentially causes faster motion perception under 1 G and 0 G conditions of the current experiment, may be the major factor in late responses.
The results of
Experiments 1 and
2 suggest no impact of vestibular gravitational information on the performance of arrival time estimation and SPEM for the downward motion of a target. However, several previous studies have indicated that vestibular information can be used for TTC estimation (
Le Séac'h et al., 2010;
Senot et al., 2005;
Zago et al., 2011). The experimental setup used in these studies differs from that used in the present study, including whether the vestibular information is used for TTC estimation or not. In previous studies, the participants looked up to see the downward motion of a target and looked down to see its upward motion, with the target approaching their faces. The relative position of the approaching target on the observer's retinal fovea was the same for both upward and downward motion. Moreover, the participants could not visually discriminate between the upward and downward motion. In contrast, vestibular signals are the information available to differentiate between upward and downward motion because they are different when looking up to see the downward motion of the target and down to see the upward motion of the target. In our study, the upward and downward motion of targets were presented in the participant's frontoparallel plane. Because the participants did not move their heads to see both targets, the vestibular signals were the same when they saw both targets. Therefore, they could not discriminate the target direction from the vestibular information. In contrast, visual information indicated that the targets moved toward the observer's head direction for the upward motion and toward the observer's leg direction for the downward motion. Participants in our study might not have relied on vestibular cues, but instead used visual cues to estimate the arrival time for upward and downward motion of targets.
Previous studies have suggested that vestibular inputs influence representational gravity (
de sá Teixeira & Hecht, 2014;
Nagai et al., 2002). These previous studies demonstrated that representational gravity was observed in the upright posture (i.e., gravity-congruent condition) in some conditions, but not in the other posture. Although our study parallels some of the methods and designs in previous studies on representational gravity, there are critical differences in the method. Participants in our study estimated the arrival time at the goal or tracked the moving target, whereas participants judged the final (vanishing) location for the moving target in the previous study. These methodological differences may produce divergent results.
Some previous studies have considered that the body's longitudinal axis seems to be the main vertical reference in the representational gravity literature (
de sá Teixeira et al., 2017).
de sá Teixeira et al. (2017) have shown that people perceive the final position of a downward motion of a target as being farther forward than the actual position of the object compared with an upward motion of a target. This vertical asymmetry in the final positional judgment of the upward and downward motion of targets depends on an egocentric vertical axis (
de sá Teixeira et al., 2017). Our study also observed that the vertical asymmetry in arrival time estimation and SPEM performance for upward and downward motion depends on the egocentric vertical axis. These results suggest that people might perceive the direction of ascending or descending motion based on their longitudinal body's axis and visual cues.
It is known that people perceive vertical direction and body orientation by multisensory inputs from visual, vestibular, and somatosensory organs (
Harris, Herpers, Hofhammer, & Jenkin, 2014;
Kersten, Mamassian, & Yuille, 2004;
Miwa et al., 2019). In multisensory inputs, sensory information is weighted depending on the reliability of each piece of information (
Harris et al., 2014;
Kersten et al., 2004;
Miwa et al., 2019). Our results on the arrival time estimation and SPEM performance for each target's motion did not show differences between upright and supine postures. This finding suggests that, under the current experimental conditions where tasks require visual information processing, the weight of visual information may be greater than that of vestibular information. Humans may learn that the visual information of a target moving in the direction of the observer's head or leg is sufficient sensory input to perceive the motion of ascending and descending objects through everyday life.
One study reported that people begin to learn the features of descending object motion in early childhood (
Kim & Spelke, 1992). This finding supports the idea that perceptual learning, rather than vestibular information, contributes to understanding the directions of a moving object throughout one's lifetime.
de sá Teixeira (2014) reported that the human reference frame for ascension and descension is the body-referenced downward direction (
Hubbard, 2020). Therefore, in a zero gravity environment, if people perceive an object's motion toward the leg direction, they may perceive it as the motion of a descending object.