We must highlight that the results in
Figure 9 are presented as a proof of concept rather than as a full-fledged model of dimension differentiation. Such a theory would require additional details in the model (e.g., modeling neural noise and a choice rule capable of handling such noise). Ideally, the theory would be able to explain additional results in the literature. For that to happen, the model might require a better formalization of representations in face space than what is presented in
Figure 9. That is, the model presented in the figure works by simply focusing on a couple of dimensions embedded within a large multidimensional space of face features. If such space were explicitly modeled, it would allow, for example, generating predictions not only about the results at the global average, but also at the category average (i.e., the bottom rows of
Figures 7 and
8, which were not simulated here). There are multiple ways in which a multidimensional face space has been modeled in the previous literature. For example, one can use a relatively arbitrary face space (
Ross et al., 2013), a space created through principal component analysis of actual face images (
Turk & Pentland, 1991), or a space of face-selective units at the end of a hierarchical feedforward model of the ventral visual stream (
Giese & Leopold, 2005;
Jiang et al., 2006). Multiple channels should be modeled rather than only two, with each channel’s tuning function, as well as internal noise, defined within the high-dimensional face space (
Giese & Leopold, 2005;
Ross et al., 2013). Finally, implementing the
dimension enhancement hypothesis in a multidimensional space would be more complex than in the simple model presented in
Figure 9, with the scaling mechanism influencing multiple channels distributed around the parent faces. The development of a full computational theory is outside the scope of the current study, and the model presented in
Figure 9 should be taken only as a working hypothesis.