Eidolons: Novel stimuli for vision research

Jan Koenderink; Matteo Valsecchi; Andrea van Doorn; Johan Wagemans; Karl Gegenfurtner

doi:10.1167/17.2.7

Abstract

Meanings and qualities are fundamental attributes of visual awareness. We propose “eidolons” as a tool for establishing equivalence classes of appearance along meaningful dimensions. The “eidolon factory” is an algorithm that generates stimuli in such a meaningful and transparent way. The algorithm allows us to focus on location, scale, and size of perceptually salient structures, proto-objects, and perhaps even semantics rather than global overall parameters, such as contrast and spatial frequency. The eidolon factory is based on models of the psychogenesis of visual awareness. It affects the image in terms of the disruption of image structure across space and spatial scales. This is a very general method with many potential applications. We illustrate a few instances. We present results for the example of tarachopic amblyopia, showing that scrambled vision is indeed an apt interpretation.

Introduction

Wouldn't it be nice to experience what colorblind people are seeing or what tarachopic amblyopes have to cope with? Viénot, Brettel, Ott, M'Barek, and Mollon (1995) told us about the former by simulating the visual appearance of unilateral dichromats. Hess (1982) informed us about the latter by requiring patients to draw what they see. This is by no means trivial. Unilateral dichromats are not all that clear in their reports (Sloan & Wollach, 1948), whereas they surely should be the first to know! It becomes clear that such introspective reports are less easy to come up with than it might seem if you try to describe—even to yourself—what you experience as you see a book shelf in the periphery of your visual field. You may somehow be aware of the presence of books with titles written on their spines, yet you cannot identify the books or read the titles. The perceptual quality of peripheral vision appears contradictory and defies your ability to describe your sensations, yet surely you are the first to know. Perhaps Titchener's (1902) lab manual should be consulted; he explains in detail how to use introspection.

Phenomena such as visual crowding have been studied by measuring detection and discrimination thresholds. The appearance of suprathreshold stimuli is much harder to study. Exceptions are very specific situations. One instance is direct contamination between flanker and target objects (Greenwood, Bex, & Dakin, 2010). One really needs first-person reports—for instance, having observers draw what they see (Metzger, 1936; Sayim & Wagemans, 2013). However, the (in)ability of observers to reproduce their visual experience limits this technique to relatively simple stimuli. Another way is to use verbal descriptions (for review, see Lettvin, 1976; Metzger, 1936; Pelli, 2008). Of course, the use of first-person reports is beset with difficulties.

Another approach might take its lead from methods as proposed by Viénot et al. (1995), who transformed color images to emulate dichromatic vision. Why not preprocess a stimulus in ways that emulate peripheral processing and present it foveally? Indeed, such methods have been proposed and used to good advantage by authors such as Rosenholtz and colleagues (e.g., Balas, Nakano, & Rosenholtz, 2009). In such cases, one really needs to test various methods of preprocessing to mimic the peripheral data. This might (at least) serve to generate various hypotheses about what goes on in peripheral vision as viable or perhaps worth pursuing.

Such methods have considerable potential. However, in order to wield them effectively, one needs to be able to explore a reasonable environment of the original stimulus. Because images can be transformed into other images in infinite ways, whereas empirical research can explore only limited ranges, there is a need for controlled variation based on our present understanding of visual processes. This is by no means an understood issue and has remained more of an art (Balas & Conlin, 2015). One really needs a much more transparent and intuitive way of parameterizing and perturbing stimuli.

Here we introduce a novel processing algorithm that produces stimuli differing from a given image in ways that are controlled by a limited number of clearly understandable parameters. Within this parametric space, we refer to the subset of stimuli that are equivalent along a given perceptual domain as eidolons (see Appendix D).¹^,²

Of course, one could use a very simple parameter space. The variable contrast of sine-wave gratings used in a modulation transfer (MTF; Schade, 1948, 1956) measurement draws on a one-parameter space of eidolons—the parameter being Michelson contrast. On the other side one could use a very complex parameter space to the point of parameterizing the luminance of every pixel within an image. At the same time, one could use a very strict criterion to establish perceptual equivalence (e.g., defining two stimuli as equivalent only when they are metameric) that is by all means indistinguishable or a wider sense equivalence criterion based, for instance, on semantics. In this wider sense, all the well-known—even famous—instances of Leonardo's Mona Lisa by Marcel Duchamp, Fernando Botero, and many other artists are eidolons.

In experimental phenomenology, one aims at parameterizations that naturally fit generic visual presentations rather than imposed physical ones. An example of two physical parameters that do not map to perception is contrast and sharpness in images. As photographers know, a high-contrast print often serves to save a slightly unsharp shot. Likewise, low-contrast prints are often considered unsharp. In such cases a natural space of eidolons may be a useful platform from which to launch research.

Another potential use of eidolons is in the study of visual anomalies and agnosias. Well-known examples are renderings of color photographs that are intended to suggest to the normal trichromat what the experiences of various dichromats might be like. While this use of eidolons might not directly answer scientific questions, such examples are useful because they offer the generic observer an opportunity to better understand more or less singular ones and thus interact more effectively with them. For instance, one might adapt one's printed or projected figures and text for more universally effective communication. Thus, the topic holds some genuine interest, even though mostly from an applied science point of view.

The body of this article is structured in three sections. Theory details the theoretical framework in which we define our concept of eidolon. Implementation contains the general description of an eidolon factory based on scale decomposition and spatial disarray. Examples shows some examples of experiments in which stimuli produced by the eidolon factory could be used.

Theory

Formal description of eidolons

As anticipated, we define an eidolon of an image as the equivalence class of images that evoke the same visual awareness—given certain specified constraints—in a given observer as the fiducial image.³ This implies that the eidolon may formally represent that awareness or aspect of awareness. An equivalence relation in vision is commonly based on whether two images look alike. In empirical science, this definition is unmanageable and needs to be converted to some operational form. This involves two distinct aspects. One is the operationalization of the equivalence. This boils down to one or more formal psychophysical measures. The other is the description of the equivalence set. This may take the form of an algorithm that produces instances on call or a set of parameters that allows attaching a unique formal label to an instance. The algorithm might be deterministic or stochastic. Likewise, the parameterization might be deterministic or stochastic.

A well-known example of an algorithm to produce instances of equivalent images is the familiar JPEG file format. Remember that JPEG (Pennebaker & Mitchell, 1993) uses lossy compression—that is, it literally replaces the fiducial image with a phantom lookalike. The user never notices losing the original except in cases of extreme compression. The JPEG eidolons have become of great economic importance and are in common use.

Perhaps the best-known case of equivalence classes in human vision is color. Infinitely many spectral compositions look like the same baby blue. Even close scrutiny in metameric lights does not reveal differences. A color is in fact a huge class of physically distinct radiations that are phenomenologically identical (Koenderink, 2010). All metameric stimuli are trivially eidolons in our definition. Conventional display units present one with such chromatic eidolons.⁴ The space of spectral compositions they offer has measure zero in the space of physical spectra yet is sufficiently extended given the bottleneck of human physiology.

An eidolon may have very high cardinality. Consider the early Julesz patterns (Julesz, 1962; see Figure 1)—100 × 100 pixel 1-bit images in which each pixel was white or black with probability one half. Define equivalence as the inability to keep two images apart at cursory view. Then any such image, with probability one, is part of the eidolon, for no two instances look different.⁵ That is why you generically cannot describe an eidolon by exhaustively listing its members. Description by algorithm—the algorithm being controlled by a few parameters—is the only operationally viable solution.

Figure 1

Jump To...

This feature is available to authenticated users only.

Related Articles

From Other Journals

Related Topics

To View More...

You must be signed into an individual account to use this feature.