We used the technique of maximum likelihood difference scaling (MLDS) (Maloney & Yang,
2003) to estimate the evolution of perceived gloss as a function of the 10 samples of our gloss series. MLDS has been demonstrated to be a robust and reliable technique for estimating underlying perceptual scales. For example, it has been successfully applied in quantifying color differences along a line in tristimulus space (Maloney & Yang,
2003) and also for quantifying the perceived distortion of an image as a function of compression (Knoblauch, Charrier, Cherifi, Yang, & Maloney,
1998).
In this procedure, an ordered sequence of four surfaces, i, j, k, and l, is sampled from the full set. These are presented to the observer as two pairs, (i, j), (k, l), one pair chosen randomly to be placed above the other. The observer’s task is to select the pair whose elements display the greater difference in appearance. If the pair (i, j) is selected, the quadruplet is assigned the value R = 0, otherwise R = 1. With a collection of N stimuli, it is possible to present N!/((4)!(N − 4)!) paired-comparisons. For example, for a collection of 10 samples, 210 non-overlapping quadruplets can be formed.
It is assumed that each of the 4 stimuli,
i,
j,
k, and
l, generate in the observer a response indicated as
ψi,
ψj,
ψk, and
ψl, respectively. These perceptual values are unknown, but it is supposed that they satisfy
if and only if the pair (
i,
j) is judged to display a greater difference between its elements than the pair (
k,
l).
To estimate the underlying perceptual scale, it is assumed that the observer bases his judgments on a decision variable, Δ, computed from the underlying sensory responses to each of the physical samples as
When Δ > 0, the observer selects the pair (
i,
j), otherwise the other pair. The MLDS procedure permits the estimation of a perceptual scale that predicts the relative magnitudes of differences between pairs. With
ψ0 and
ψ9 fixed at values of 0 and 1, respectively, the values
ψi,
i = 1 – 8 are estimated by maximizing the likelihood,
where Φ is the cumulative normal distribution function,
q =
ijkl and
s is the standard deviation of the observer’s judgments. Including the value of
s, 9 parameters in total are estimated based on the 210 judgments. In practice, the logarithm of the likelihood is computed and its negative minimized. All calculations were performed in the Matlab computing environment.
The log likelihood was subsequently used to test differences between the estimated scales for different conditions using a nested hypothesis test (Hoel,
1984). In short, the log likelihoods were compared under two hypotheses, that a single perceptual scale sufficed to describe both conditions (9 parameters) or that a different perceptual scale was necessary for each condition (
m × 9 parameters, where
m is the number of conditions). The test can be described as
where
li is the log likelihood under the hypothesis of a single perceptual scale,
i = 0, or multiple perceptual scales,
i = 1, and the difference is distributed as
χ2 with 9 (
m − 1) deg of freedom.