Burnham and Anderson (2002) recommend a multimodel approach in which we use the information in
Figure 7 to eliminate features that have little support and find combinations of features that best account for the data. In single feature models, we found that the length of the longest repeating subsequence and the number of repetitions dominate the two other features regardless of the probability of disruption. Among two-feature models, 12, 13, and 23, are best supported by data in the LD and HD conditions. In the condition MD, the two-feature models 14 and 24 were also equally supported. Given these results, unsurprisingly, 123 is the best supported among three-feature models in all conditions.
In terms of model parsimony, to what extent the model can be improved by adding one more variable is important. The evidence ratios between the best single feature model and the best two-feature model for the LD, MD, and HD conditions were 59.2, 3.0, and 12.8, respectively, whereas the evidence ratios between the best two-feature model and the best three-feature model were 3.1, 2.0, and 5.4, respectively. At least two features are needed to adequately fit the data. Adding one more feature to the two-feature model does not substantially increase the model fitting performance. We tentatively conclude that observers overall are using two features of the four we consider. To summarize, F1 (the longest repeating subsequence) and F3 (number of repetitions) are the critical features to capture human performance in discriminating patterns.