(A) Left: The object-map model outperforms the ITTI* significantly above chance when using the classic AUC score (AUC Type-1;
Supplement) aligned with the original analysis of Einhäuser et al. (
2008) (
t test,
p < 0.05; See also
Figure S2A). Right: Using the shuffled AUC score (sAUC; AUC Type-3;
Supplement), which discounts center bias in eye data, every difference between the object map and other models is statistically significant except the ITTI* model. This result shows the importance of appropriate tackling of center bias in fixation data (our first analysis; no significant difference between the object-map model and the ITTI* model). With newer saliency models or even with the original ITTI98 model (with a different normalization scheme than the newer ITTI model resulting in smoother maps), image-based outliers predict fixations significantly better. AWS model scores the best. MEP and random models score lowest about 0.5. This supports our second analysis about choosing the right model for data analysis (i.e., dependency of conclusions on the used model). To build a Gaussian of sigma size 0.28° × 0.28° on this dataset, we used this Matlab script: myGauss = fspecial(‘gaussian',50,10) where 50 and 10 are image size and standard deviation of the Gaussian, respectively. To build the normal random model, we used the Gaussian shown in (A) and upsampled it to 1024 × 768 pixels (original image size presented at 29° × 22° in Einhäuser et al.'s,
2008, study, thus ∼ 35 pixel/°), resulting in 5.85° × 4.39°. (B) Gaussian blobs (kernels) of three different sizes were added to model prediction maps (shown here for five models). Gaussian kernels are built with size
x and image size 200 using this Matlab script: myGauss = fspecial(‘gaussian',200,
x) where
x ∈ {10, 30, 50} which leads to these sizes in degrees: 1.45° × 1.1°, 4.35° × 3.3°, and 7.25° × 5.5° for the used dataset. Each prediction map of a model was smoothed by convolving with a Gaussian filter (for the shown image). Gaussian sizes for smoothing are: 0.28° × 0.28°, 0.86° × 0.86°, and 1.43° ×1.41°. (C) Prediction accuracies of models using the sAUC score: (left) center bias added and (right) smoothed saliency maps. Significance values are according to Bonferroni-corrected
t test (
α = 0.05/5 = 0.01). By adding center bias, the object-map model is significantly above the ITTI model but not the ITTI98 model. Adding center bias does not dramatically change prediction power of models. With smoothing, the object-map model is significantly below the ITTI98 model. Smoothing more rises the accuracy of the ITTI model to the point that there is no longer a significant difference between this model and the object-map model. There is also no significant difference between the object-map model and ITTI* model with small amount of smoothing while with large amount of smoothing the ITTI* model outperforms the object-map model significantly. This supports our third analysis on parameterization. Error bars indicate standard error of the mean (
SEM):
σ /
m0.5, where
σ is the standard deviation and
m = 93 is the number of images. Please refer to Borji, Sihite, and Itti (
2013) for details on fixation prediction models used here. See main text and
Supplement for more details.