The purpose of the present study was to investigate whether clutter, an image-based measure of visual complexity, can serve as a proxy for set size in real-world scene search. Rosenholtz et al. (
2005; see also Bravo & Farid,
2008) observed a relationship between several measures of clutter and search in complex images of maps. We sought to extend this work to search in photographs of real-world scenes, using both traditional search measures and eye movement behavior as dependent measures.
Overall, although there was some variability across clutter measures depending on the specific analysis, our results demonstrate that clutter does correlate with both global search efficiency (measured by search time and search failure) as well as with eye movement behavior during search. The latter result is novel and provides the first direct evidence that eye movements during scene search are influenced by the degree of clutter present in the scene.
It is interesting that edge density, despite failing to capture color variability in an image, does not do significantly worse than feature congestion and sub-band entropy in predicting search efficiency. In fact, edge density was the only clutter measure to significantly correlate with all of the reported dependent measures. This effect is generally consistent with the observation that eye movements during scene viewing are very similar for color and gray-scale versions of the same pictures (Henderson & Hollingworth,
1998). The edge density result is also interesting in light of the finding that edges correlate with fixation location in real-world scenes better than luminance contrast (Baddeley & Tatler,
2006) and the finding that color is not a strong correlate of fixation location (Tatler, Baddeley, & Gilchrist,
2005). As in the case of feature congestion, visual saliency is a more complex measure than edge density because it takes additional image variability including color into account. The present results converge with those Baddeley and Tatler (
2006) and Tatler et al. (
2005) in suggesting that the most important image property for predicting search efficiency at both a macro (e.g., response time) and micro (e.g., eye movement) level of analysis may be edges.
The influence of clutter in the present experiment, though statistically significant, generally accounted for a relatively small amount of the variance in each of the measures of search. Also, the influence of clutter was relatively small compared to the influence of set size typically observed in standard visual search tasks (Wolfe,
1998). Why was the influence of clutter not more pronounced? We suspect that there may be several reasons. The measures of clutter proposed by Rosenholtz and used here may be only an approximation of perceived clutter, which may also take into account other factors beyond those included in the three measures. In addition, other image effects such as crowding, masking, and so forth are very likely to play a role in determining search difficulty. Higher level image features related to scenes, such as those proposed by Torralba and Oliva (
2003) in their spatial envelope theory (e.g., degree to which the scene is open–closed, natural–artificial, and near–far), could also be important for search in natural scenes. Finally, higher level cognitive factors like the semantic consistency of the objects in the scene, scene coherence, the degree to which the scene is familiar, and so on may all play a role in search efficiency (Henderson,
2007). These higher level factors are clearly more difficult to capture with image-based indexes. Both additional image features and higher level factors would contribute error variance to search performance and so reduce the correlations of search with clutter.
In summary, we found that clutter correlates with search performance in real-world scenes. Furthermore, we have provided the first evidence that clutter also predicts eye movement characteristics during real-world search. These data converge with those presented by Rosenholtz et al. in showing that an image-based proxy for search set size can be related to search performance in real-world scenes.