We relied on optokinetic nystagmus for inferring perception and for identifying the moments when it switches. To our knowledge, no existing study has used this approach in the context of structure-from-motion stimuli. Instead, OKN has been used for this purpose primarily in the context of binocular rivalry (
Fox et al., 1975;
Naber et al., 2011;
Frässle et al., 2014;
Brascamp et al., 2021), as well as for moving plaids (
Wilbertz et al., 2018) and transparent motion (
Watanabe, 1999). Our results indicate that OKN can be used in this way to study bistable structure-from-motion, as well, but the approach certainly has shortcomings. In particular, for most of our observers, the OKN method identifies a larger number of switches than the observer reports (i.e., the majority of our observers are positioned above the identity line in
Figure 1C, left), suggesting that the method spuriously marks some non-switch moments as switches. Aside from spuriously marked switches, an additional reason for the discrepancy could be that some percepts were too brief for manual reports, yet did cause OKN direction reversals (cf.,
Naber et al., 2011). This is also likely part of the explanation for why the magnitude of the computed pupil dilation response is so much smaller for task-relevant perceptual switches (
Figure 2A blue curve) than for task-relevant triangle inversions (
Figure 2B orange curve): for the former, the contribution of actual switch moments to the response estimate has been diluted by that of spuriously marked ones. Although our OKN-based algorithm is overall similar to gaze-based methods used for identifying perceptual switches elsewhere, our approach is relatively basic compared to some other approaches, and it is quite possible that the correspondence with manually reported switches can be improved by incorporating components of those other approaches. This includes, rather than using just the gaze displacement angle of the OKN slow phase, combining both slow-phase and fast-phase metrics in a perception classification algorithm (
Wilbertz et al., 2018), or using polynomial splines to interpolate missing data in the gaze displacement velocity trace (
Aleshin et al., 2019) rather than our basic linear interpolation method.