Several parameters of the model, however, do end up being very different for various types of visual stimuli. On one hand, fitting the responses to disparity versus motion stimuli requires the use of quite dissimilar SF-dependent weight functions. The idea that the contributions of different components are weighted is not new (e.g.,
Meso, Gekas, Mamassian, & Masson, 2022;
Miura, Inaba, Aoki, & Kawano, 2014a). For disparity stimuli, the weights are a power function of SF (
Equation 6) with a negative power coefficient; that is, the weights of the components become smaller as the component SF goes up (
Figures 6E and
6F; see also figures 6C and 6D in
Sheliga et al., 2022). For motion stimuli, the weights are an inverted cumulative log-Gaussian function (for broadband stimuli) or the product of a power function and an inverted cumulative log-Gaussian function (for two or three combined sine-wave gratings). The coefficient of the power function, however, is positive; that is, the weights of the components remain the same or grow as the component SFs go up from low to intermediate values, followed by a steep decline when the component SFs increase further (
Figures 5C,
5F, and
5I;
Figure 6D;
Figures 7D to
7F;
Figure 8B; and
Figure 9C). As Quaia and colleagues noted, a reason underlying certain differences between disparity and motion processing might be “ecological: motion and disparity signals have different patterns of occurrence in the environment” (
Quaia, FitzGibbon, Optican, & Cumming, 2019). On the other hand, fitting the responses to the motion of two or three combined gratings versus broadband stimuli had its share of dissimilar best-fit parameters. The differences in the shape of the SF-dependent weight functions were just outlined above. Also, TF-based weight corrections were crucial when fitting gratings data but were not necessary when fitting broadband data. Finally, for both OFRs and DVRs, power summation coefficients (free parameter
m) were usually higher than 1 for gratings (median, 1.2; range, 1–1.5) and lower than the 1 for white noise stimuli (median, 0.78; range, 0.66–0.99). These grating and white noise discrepancies in the parameters of our model may be due to the fact that the model is descriptive: it operates with OFRs and DVRs to Fourier components of the stimuli, whereas the real-life primary cortical disparity and motion detectors are bandpass. The SF tuning of neurons in striate cortex of primates (
De Valois, Albrecht, & Thorell, 1982;
Schiller, Finlay, & Volman, 1976) and cats (
Kulikowski & Bishop, 1981;
Movshon, Thompson, & Tolhurst, 1978) was best described by a fixed bandwidth on a log scale, the median being around 1.4 octaves, decreasing slightly for detectors having higher central SFs (
De Valois et al., 1982). Thus, a single detector could be sensitive to a few (or many) Fourier components. When using broadband stimuli, many detectors are concurrently activated whose bandwidths may overlap or not. When using gratings, on the other hand, we usually chose SFs that were far apart; thus, such stimuli activated detectors that had little or no bandwidth overlap. We still have limited knowledge regarding the interactions between different detectors, but they potentially might be the reason for observed variations in best-fit parameters of our model for gratings versus broadband stimuli. We also know that normalization occurs at different stages of processing (retina, lateral geniculate nucleus, V1, MT, etc.). Because of the nonlinearities, cascading multiple normalization stages can give rise to peculiar overall functions, and even relatively small differences at a single stage might result in large differences downstream (for a discussion of these issues, see
Quaia et al., 2017b;
Quaia et al., 2018).