Figure 2B is a scale–space map showing the gradient response pattern
L 1 across space (
x) and scale (
σ), given two Gaussian-blurred edges as input. How might the location, polarity, and blur of these edges be identified? The sign of the responses corresponds to the polarity of the edge (dark–light or light–dark), but the biggest response is always at the finest scale (1 pixel; top row of the map). This is true even for much larger blur (
Figure 2G). The degree of blur is implicit in how far the response extends through scale, and across space, but further analysis would be needed to quantify it. The edge would be explicitly identified, however, if the response pattern had a unique peak at the scale and location of the edge. Lindeberg (
1998) devised a powerful, general method of “scale selection” to achieve this goal. Lindeberg explored algorithms in which the response measures for localization and for scale selection could be different. We simplified his method by supposing that peaks in a single response surface (
Equation 4) should identify the position and scale of the features. The Gaussian derivative operators are multiplied by a scale-dependent gain factor
σ α to give a normalized scale–space representation,
N n :
By differentiating
N n with respect to
σ, it is straightforward to show that, for a given class of feature such as Gaussian edge, bar, or blob, one can choose the exponent
α such that the response always peaks at the true location and scale of the feature (Lindeberg,
1998). For Gaussian-blurred edges (
Figures 2A and
2F),
α =
n / 2 (where
n is an odd integer, implying odd-symmetric RFs).
Figures 2C and
2H show that with this multiscale representation of gradients,
N 1, there are unique response peaks at the correct locations and scales for the edges in the image. These edge features have been “made explicit” (Marr,
1982). Importantly, this rendering of edge finding as a simple peak-finding problem seems to reduce, or perhaps eliminate, the need for more elaborate algorithms that combine or “track” edge information across scales (Bergholm,
1987; Zhang & Bergholm,
1997).
Figure A1 (
1) shows how the normalization converts monotonic response profiles across scale into peaked ones that identify the edge blur.
1 analyzes some of the scaling properties of these scale-normalized derivatives.