The data from two participants were removed from the analyses because they were deemed extreme outliers. One participant from each condition deviated more than 3 standard deviations from the mean in rate of change in average speed scores and were therefore removed from all analyses. The final number of participants used in the analyses was 38; 19 participants in the Feedback condition and 19 participants in the No Feedback condition.
A one-way between-subjects ANOVA revealed that a difference in baseline MOT capability was statistically detectable between the two feedback conditions. Specifically, the group of participants that was assigned to the No Feedback condition could track targets faster (
M = 98.92,
SD = 25.01) compared to the group that was assigned to the feedback condition (
M = 81.55,
SD = 20.09):
F(1, 36) = 5.57,
p = 0.024, partial η
2 = .13, 95% CI [0.00, 0.37] (
Figure 3a).
To account for these a priori differences in baseline MOT capability, even though participants were randomly assigned to feedback conditions, we calculated the standardized change between baseline and D4. For instance, the standardized change consists of the difference between the new score and the baseline score, divided by the standard deviation of baseline scores for all participants: standardized change in MOT = (D4 MOT score − baseline MOT score) / standard deviation of MOT at baseline. A standardized change score considers the variance from the sample and can therefore account for an imbalance in baseline means between the two conditions. This descriptor is used throughout the field of cognitive training to examine improvement or change on cognitive measures (see
Jaeggi, Buschkuehl, Jonides, & Perrig, 2008, for example).
We conducted a one-way between-subjects ANOVA using this metric to assess whether the presentation of feedback positively impacted learning the attention-based task. Our results revealed that the feedback group had a larger standardized change (
M = 1.17,
SD = 1.26) compared to the group that did not receive feedback (
M = 0.19,
SD = 1.35):
F(1, 36) = 5.33,
p = 0.027, partial η
2 = .13, 95% CI [0.00, 0.36] (
Figure 3b). A change of this magnitude suggests that those that received feedback improved by more than one full standard deviation, relative to the entire sample. Furthermore, the baseline means between conditions are separated by a z-score of 0.25. Therefore the change from baseline to D4 for the feedback group was more than four times larger than the distance between conditions at baseline.
Next, two paired samples
t-tests (one per condition) were conducted to examine whether MOT capability improved for either condition. The results revealed that the those in the Feedback condition significantly improved from baseline (
M = 81.55,
SD = 20.09) to D4 (
M = 109.63,
SD = 33.26):
t(18) = 4.03,
p < 0.001, Cohen's
d = 0.92, 95% CI [0.37, 1.46], whereas those in the No Feedback condition did not improve from baseline (
M = 98.92,
SD = 25.01) to D4 (
M = 103.47,
SD = 32.86):
t(18) = 0.61,
p = 0.549, Cohen's
d = 0.14, 95% CI [−0.31, 0.59] (see
Figure 3a). Notably, this study would not be the first to demonstrate an improvement in MOT performance with the inclusion of feedback.
Previous research has examined the change in MOT capability across multiple sessions (
Faubert, 2013;
Legault, Allard, & Faubert, 2013;
Tullo, Guy, Faubert, & Bertone, 2018). Although the current study is the first to isolate the effect of feedback on MOT performance, the aforementioned studies demonstrated a learning effect for MOT with the inclusion of feedback. For instance,
Tullo, Guy et al. (2018) examined the efficacy of MOT to improve performance on a separate, clinically validated measure of attention for children and adolescents diagnosed with neurodevelopmental conditions. This study demonstrated that repeated practice on the MOT paradigm improved performance by a standardized change of 0.62 (i.e., 41% increase as reported in the paper) after 15 sessions. The data from
Tullo, Guy et al. (2018) suggests an improvement as early as four sessions. Moreover,
Faubert (2013) demonstrated that the learning rates for MOT differed across levels of athleticism in a typically developed adult population. Similarly, the data from
Faubert (2013) suggests a significant improvement after 15 sessions and as early as four sessions. Lastly,
Legault et al. (2013) examined the differences in learning MOT between younger (i.e., ages 22–34) and older adults (i.e., ages 61–74). The study concluded that both groups significantly improved after five total training sessions. Therefore the contextualization of performance through the use of the standardized change score and the evidence of improvement on MOT performance with the presence of feedback from related work suggests that tracking capability can be improved or learned.
To compare the learning trajectories of the MOT task across the four testing sessions: (i) we mapped average speed scores onto a logarithmic curve, which has been characterized as a typical learning curve by previous research describing learning effects on a MOT task (
Faubert, 2013;
Tullo, Guy, et al., 2018) (ii) compared the trends using regression models. The feedback condition's trajectory followed this pattern by the equation: y = 20.019ln(x) + 85.152 at
R2 = 0.85, whereas the group that did not receive feedback did not map onto this pattern as clearly: y = 6.1488ln(x) + 96.456 at
R2 = 0.26. This discrepancy suggests that the MOT capability across a period of four days followed a pattern similar to a typical learning curve, whereas the group that did not receive feedback did not (
Figure 3c).
To examine whether learning trajectories differed between feedback conditions, we conducted two multiple regression analyses. The first model, consisted of (i) the log of testing day (i.e., one through four), (ii) feedback condition, and (iii) the interaction between feedback condition and log of testing day predicting the log of the average speed score. However, this model was not statistically detectable: F(3, 148) = 2.61, p = 0.05, R2 = 0.05, Adjusted R2 = 0.03. Next, we conducted a hierarchical regression to examine the effect of feedback condition above and beyond baseline performance. The first step of regression model was statistically detectable: F(1, 36) = 6.10, p = .018, R2 = .14, Adjusted R2 = .12; where baseline threshold was a significant predictor of the standardized change in MOT performance from D4 to baseline: b = −0.02, t(36) = −2.47, p = 0.018. Adding condition as the next step resulted in a statistically detectable model: F(2, 35) = 4.39, p = 0.020, R2 = 0.20, Adjusted R2 = 0.16. However, the value added to the model with feedback condition was not statistically detectable: F(1, 35) = 3.14, p = 0.085, nor were the predictors baseline: b = −0.02, t(35) = −1.77, p = 0.085, and condition: b = −0.69, t(35) = −1.56, p = 0.127. The third and final step of the model examined whether (i) baseline speed scores, (ii) feedback condition, and (iii) the interaction between baseline scores and condition predicted the standardized change in MOT performance. This model was statistically detectable: F(3, 34) = 2.93, p = 0.047, R2 = 0.20, adjusted R2 = 0.14; nevertheless, the value added by the interaction predictor was not statistically significant: F(2, 34) = 5.40, p = 0.021, nor were there any statistically detectable predictors.
The results from the regression models demonstrate some inconsistency in the findings for Study 2. For instance, baseline tracking capability was associated with improvement in MOT performance. This finding coupled with the imbalance in tracking capability favoring the no feedback condition, suggests that more evidence is required to conclude that feedback improves performance on the attention-based task. Moreover, comparing the change in performance between testing sessions adds to the mixed findings.
We compared the rate of learning across daily sessions between the two conditions (see
Figure 3d). We conducted a two-way mixed method ANOVA, where the standardized change interval of (i) D2 versus baseline, (ii) D3 versus baseline, and (iii) D4 versus baseline as the within-subjects factor and feedback condition as the between-subjects factor. The results of the ANOVA did not reveal a statistically detectable interaction between standardized change and condition
F(2, 72) = 1.22,
p = .301, partial η
2 = .03, 95% CI [0, .13], nor a main effect of standardized change:
F(2, 72) = 1.39,
p = .255, partial η
2 = .04, 95% CI [0, .14]; however, there was a between-subjects main effect of condition:
F(1, 36) = 6.04,
p = .019, partial η
2 = .14, 95% CI [.00, .38]. Moreover, we conducted two separate post-hoc one-way ANOVAs examining the differences in standardized change intervals between conditions. Our results revealed that those receiving feedback had a greater standardized change between D2 and baseline:
F(1, 36) = 6.60,
p = .015, partial η
2 = .15, 95% CI [.01, .39] and D4 and baseline (see above). There was no statistically detectable difference in the standardized change of D3 and baseline between conditions:
F(1, 36) = 1.53,
p = .225, partial η
2 = .04, 95% CI [0, .23] (
Figure 3d). Once again, these results support the notion that feedback is beneficial for learning on an attention-based task and that it took the group that did not receive feedback two testing days to achieve a statistically similar rate of change as the individuals that received feedback.
Taken together, these results, which indicate that exposure to feedback positively influenced learning on the MOT task, must be interpreted with caution, and this research question warrants further investigation. For example, future work is encouraged to explore the effect of this exposure by adding more testing sessions. Adding more than four training sessions would allow for an examination of the learning trajectories between conditions with higher-level statistical modelling. Overall, exploring the benefits of feedback in an attention-based task such as MOT would contribute to the knowledge that feedback aids performance from previous research examining this effect in perceptual-learning to language- and memory-based tasks (
Ashby et al., 1998;
Ashby & O'Brien, 2005;
Herzog & Fahle, 1998,
1998;
Maddox et al., 2004;
Patalano et al., 2001;
Roelfsema et al., 2010;
Shibata et al., 2009;
Waldron & Ashby, 2001;
Wilbert et al., 2010;
Zeithamova & Maddox, 2006). Therefore Study 2, which is the first to isolate feedback with repeated practice on an attention-based task, creates an avenue for future research to explain how attention is needed to consolidate the information provided by feedback.
Compared to Study 1, where there was no time to consolidate feedback between testing days, the 4-day learning period may have provided the participants with the opportunity to consolidate the feedback they were provided between daily sessions. The day long gap between testing can allow for the proper processing of
Zeithamova and Maddox's (2006) steps for feedback integration. This is perhaps best evidenced by the sharp increase in capability once feedback is introduced to half the participants. Although the time between the presentation of feedback and the start of the next trial and/or testing session was not controlled for here, and given MOT's ability to highlight feedback-specific load, future research could explore what the optimal condition is for the presentation of feedback and the start of the next trial.