One possible concern with our results is that none of the observers in the experiment were completely naïve, and that their knowledge of the model might have influenced their color matches. Specifically, all of the observers knew that the model predicted six categories of color matches in the entire study. However, in a given noise condition, and thus in a given color matching session, the model actually predicts four, five, or six mechanisms contributing to thresholds (and thus four, five, or six clusters of color matches), with some individual differences. Two of the observers (CLM and SAF) did not know which noise conditions were expected to generate four, five, or six clusters of matches, and they were also not told which noise condition they were being exposed to in any given session (although they could probably guess when the 64° noise was used, since its appearance is quite different from the 42° and 48° noises). Nonetheless, in every noise condition, the number of color match clusters in a particular condition agreed perfectly with the number of mechanisms predicted from that observer's own detection model, whether that was four, five, or six clusters. Most importantly, none of the observers knew which test stimulus was presented at any given time, yet the color matches were almost perfectly consistent with the particular color mechanism that the model asserts detected a given test in a given noise condition.
Putting this all together, the worst-case effect of the observer's knowledge would be that, rather than actually matching the test, they selected a memorized color category that was most similar to the test. We do not believe this happened, but if it did the memorized color would have to be selected on the basis of the test that was seen, since the observers did not know which test was presented. Thus, the most extreme possible effect of their knowledge would be to reduce the variability of the color matches in the clusters, making the task more like color naming than color matching. Even in this case, the pattern of these hypothetical categorical matches would have to consistently change with noise condition in exactly the same way as the detection mechanisms. Our conclusion is that the knowledge of the observers is unlikely to have affected the outcome of this study in a major way.