In developing models for feature detection in machine vision and building on the pioneering work of Koenderink (
1984) and Koenderink and van Doorn (
1987), Lindeberg (
1998) showed elegantly how, for any given feature such as a blurred bar, blob, or edge, the amplitudes (gains) of a set of multiscale Gaussian derivative filters of suitable order (first, second, third…) could be chosen a priori to ensure that a peak response over filter scales would always occur in the filter whose scale equals that of the image feature in question. Thus, in Lindeberg's terminology, one can use
scale normalization (the setting of filter gains) to achieve automatic, image-driven
scale selection. This peak-finding scheme can identify the location, scale, and identity of image features—a crucial step in early visual coding. Georgeson et al. (
2007) showed how these principles could be used to model the process of localizing blurred edges and encoding edge blur in human vision. They evaluated two models called N1
+ and N3
+ that used multiscale first- and third-derivatives, respectively. (Note
: N denotes use of Lindeberg's scale-normalization; one or three indicates the order of derivatives used; + indicates the use of half-wave rectification on the output of the filters.) In the present paper we describe how this approach via Gaussian derivatives in scale-space can be extended to encode both bars and edges using even- and odd-symmetric filters, respectively (
Figure 1D; described in detail later). We suggest that these filters do not act as independent channels, nor are their responses combined (as in the local energy model; Morrone & Burr,
1988; Ross et al.,
1989), nor do they act competitively. Rather, it is a comparison of the even and odd filter responses that enables a decision about feature presence or absence. Such a comparison also formed part of the local energy model where it was used to evaluate local phase and classify energy peaks as either bars or edges. Here we make no use of the energy measure because it is (by definition) the smooth spatial envelope of the even and odd responses, and that turns out to exclude too many features that are actually perceived (Hesse & Georgeson,
2005).