The logic of metamerism as a criterion for testing models of scene perception (J. Freeman & Simoncelli,
2011) is that if a candidate model captures all the image features important for appearance under some condition, then two images that produce the same model output (i.e., the same values for all features the model encodes) should also appear identical to a human observer (
Figure 1). A candidate model can be used to predict which images should be metamers. This logic allows the simultaneous experimental testing of a large number of image-based cues (that is, all cues encoded by a given model), complementing approaches to natural scene perception in which at most a handful of cues are manipulated at once (e.g., Alam, Vilankar, Field, & Chandler,
2014; Bex,
2010; Bex, Mareschal, & Dakin,
2007; Bex, Solomon, & Dakin,
2009; Bradley, Abrams, & Geisler,
2014; Dorr & Bex,
2013; Haun & Peli,
2013; McDonald & Tadmor,
2006; Tadmor & Tolhurst,
1994; Thomson, Foster, & Summers,
2000; To, Gilchrist, Troscianko, & Tolhurst,
2011; Vilankar, Golden, Chandler, & Field,
2014; Vilankar, Vasu, & Chandler,
2011; Wallis & Bex,
2012; Wallis, Dorr, & Bex,
2015; F. Wichmann, Drewes, Rosas, Gegenfurtner,
2010; F. A. Wichmann, Braun, & Gegenfurtner,
2006). Because a potentially large set of features are available in natural scenes, and humans are very sensitive to many of them (Balas & Conlin,
2015; Emrith, Chantler, Green, Maloney, & Clarke,
2010; Gerhard, Wichmann, & Bethge,
2013; F. A. Wichmann et al.,
2006), demonstrating that human observers cannot discriminate modified (but model-matched) images from the original image is a strong test of the degree to which the transformations from retinal input to perception are captured by a candidate model. If images producing matching model responses also look the same, then any image properties not encoded by the model are perceptually unimportant.