One goal of visual perception research is to characterize the relationship between visual experiences and the physical world. Mathematics and physics provide us with sophisticated tools to measure variables in the physical world, but we struggle to provide equally sophisticated tools to characterize the variables of visual experience.
The most widely used tool to assess people’s subjective experiences is still Fechner’s method of adjustment (
Koenderink, 2013), or simply matching. In matching, an observer adjusts the intensity of a test stimulus so that it looks identical to a given target stimulus. When the test stimulus varies only along a single dimension, the method can be likened to measuring the unknown length of a rod with a ruler.
The analogy is not exactly right, because matching procedures rely on a linking assumption whereby observers’ matches reflect the function that relates physical and visual magnitudes but do not reveal the shape of the function directly (see, e.g.,
Maertens and Shapley, 2013;
Wiebel et al., 2017).
Figure 1 illustrates the relationship between matches and internal scales. A target of a certain physical intensity
xT (i.e., luminance), evokes a response on the perceptual dimension of interest Ψ(
xT) (i.e., lightness). To perform a matching, the observer chooses a physical match intensity,
xM, which evokes a perceptual response, Ψ(
xM), that is as close as possible to the perceptual response to the target. The functions that relate Ψ(
x) and
x are known as perceptual scales, transducer functions (e.g.,
Kingdom and Prins, 2010), or transfer functions in lightness perception (
Adelson, 2000). It is evident from
Figure 1 that one and the same pattern of matching data (
Figure 1B) may be consistent with different combinations of internal response functions (
Figure 1A). Thus, matching data alone are insufficient to infer perceptual scales.
A more straightforward approach to measure perceptual scales are scaling procedures. A variety of scaling procedures has been developed in the history of psychophysical research, from Fechner’s integration of just noticeable differences (
jnds) to Stevens’s direct scaling techniques (for a review, see, e.g.,
Gescheider, 1997;
Marks and Gescheider, 2002), but their validity has been a topic of debate. For example, integrating
jnds is problematic, practically, because the error in each JND estimation propagates to the subsequent estimation, and theoretically, because the shapes of the estimated functions will differ as a function of the noise underlying the perceptual judgments (
Kingdom and Prins, 2010;
Kingdom, 2016). Stevens’s direct methods (e.g., magnitude estimation, ratio estimation) might be affected by the choice of the numerical categorization and hence are not guaranteed to provide a meaningful perceptual scale either (see, e.g.,
Treisman, 1964;
Krueger, 1989).
More recently,
Maloney and Yang (2003) presented a new type of psychophysical scaling method based on judgments of perceived differences, Maximum Likelihood Difference Scaling (MLDS). MLDS promises to reliably estimate perceptual scales and to be more robust when compared with other scaling methods (
Knoblauch and Maloney, 2008). The method uses a stochastic model of difference judgments, which allow maximum likelihood estimation of the underlying perceptual scale. Practically, an MLDS experiment can be executed with the “method of triads” or the “method of quadruples” (
Knoblauch and Maloney, 2012). In the method of triads, the observer is presented with three ordered stimuli and has to judge which of the two extremes is more different from the one in between, a procedure rather intuitive for the observer.
1 Using simulations, we showed that MLDS is able to recover different ground truth perceptual scales regardless of whether we assumed the underlying noise to be additive or multiplicative, that is, constant or proportionally increasing across the scale (
Aguilar et al., 2017).
MLDS is straightforward when the goal is to characterize a single perceptual scale. However, more often the goal is to characterize how the mapping between a physical and a perceptual variable changes when certain aspects of the viewing conditions are varied, that is, across viewing contexts. In matching, this has been identified as a problem in situations in which the context renders target and match so different that the best the observer can do is a minimum difference “match” (see, e.g.,
Logvinenko and Maloney, 2006;
Ekroll et al., 2004). MLDS avoids the problem of comparisons across contexts because all elements of a triad are always shown in one context. Perceptual scales are estimated from analogous triad comparisons in all contexts the experimenter is interested in. This raises the question of whether the scales measured in different contexts can be meaningfully compared. In this article, we evaluate whether MLDS allows for cross-context comparisons between perceptual scales. We also evaluate a second difference scaling procedure called Maximum Likelihood Conjoint Measurement (MLCM;
Knoblauch and Maloney, 2012). We will describe the details of the method below.