This study involved 32 participants across two sites (mean age = 31.38 ± 11.07; mean ± SD), with a 50:50 split of males and females. The Exercise group had nine males and seven females, and the Non Exercise group had nine females and seven males. Data were combined across both sites for analyses. The mean %HR for the step test of the Exercise and Non Exercise groups were 0.62 ± 0.14 and 0.68 ± 0.11 (mean ± SD), respectively. The mean starting letter sizes of the Exercise and Non Exercise groups were 0.94 ± 0.02 and 0.92 ± 0.02 (mean ± SD), respectively.
Analyses were performed using R Version 4.3.1 (
R Core Team, 2023). A visual inspection of
Q-
Q plots indicated that some portions of the dataset were not normally distributed—baseline measures: top condition ratios (
W = 0.95,
p = 0.13) and bottom condition ratios (
W = 0.75,
p < 0.001); post-measures: top condition ratios (
W = 0.93,
p = 0.04) and bottom condition ratios (
W = 0.78,
p < 0.001). Transformation did not help to normalize the dataset so non-parametric analyses were used, supplemented by Bayes factors where possible (calculated using JASP) (
JASP, 2024).
Two-sample paired Wilcoxon tests were run to compare performance between baseline measures and post-measures of both groups in the trained location (top). Both the Exercise group (
W = 66,
p < 0.001, effect size
r = 0.98 [
BF10 = 313.77,
W = 134,
R-hat = 1.10], median = 2.28 [pre-] and 1.57 [post], interquartile range [IQR] = 1.47–2.63 [pre-] and 1.29–1.79 [post]) and the Non Exercise group (
W = 54,
p = 0.003, effect size
r = 0.73 [
BF10 = 22.72,
W = 122,
R-hat = 1.05], median = 2.02 [pre-] and 1.60 [post], IQR = 1.84–2.41 [pre-] and 1.33–1.92 [post]) showed a difference between baseline and post measures indicating an effect of PL. However, a Wilcoxon rank-sum test showed no statistical difference between the pre- and post differences of the two groups (
W = 112,
p = 0.56, effect size
r = 0.10 [
BF10 = 0.45,
W = 112,
R-hat = 1.00]), indicating no effect of alternating exercise and visual training. See
Figure 3 for a visual representation.
Linear regression models were fitted to each participant's daily sessions to obtain slopes determining the rates of learning (see
Figure 4). Slope values representing task performance across daily sessions were calculated using the daily mean. A comparison between the groups did not show a statistical difference between the rates of learning (mean slope and SD values for the Exercise and Non Exercise groups were −0.09 ± 0.14 and −0.14 ± 0.08, respectively;
t = −0.31,
p = 0.76).
To determine if there was an effect of biological sex, the dataset was split by sex, and two-sample paired Wilcoxon tests were run for each group (see
Figure 5). There was no difference in the amount of task improvement between males and females for either the Exercise group (
W = 20,
p = 0.25, effect size
r = 0.29 [
BF10 = 0.74,
W = 43.00,
R-hat = 1.00], median= −0.63 [male] and −0.43 [female], IQR = −1.17 to −1.37 [male] and −0.70 to −0.18 [female]) or the Non Exercise group (
W = 26,
p = 0.61, effect size
r = 0.13 [
BF10 = 0.51,
W = 37.00,
R-hat = 1.00], median = −0.51 [male] and −0.44 [female], IQR = −0.78 to −0.31 [male] and −0.59 to −0.22 [female]).
To determine if the extent of VPL transfer was affected by exercise, performance for the bottom condition was analyzed (see
Figure 6). Both the Exercise group (
W = 28,
p = 0.007, effect size
r = 0.67 [
BF10 = 33.027,
W = 96,
R-hat 1.03], median = 1.50 [pre-] and 1.17 [post], IQR = 1.21 to 1.79 [pre-] and 1.11 to 1.50 [post]) and Non Exercise group (
W = 62,
p = 0.001, effect size
r = 0.80 [
BF10 = 122.476,
W = 130,
R-hat = 1.02], median = 1.86 [pre-] and 1.52 [post], IQR = 1.59 to 1.92 [pre-] and 1.34 to 1.75 [post]) showed a difference between baseline and post measures indicating a transfer of VPL. However, a Wilcoxon rank-sum test indicated that there was no statistical difference in improvement between the two groups (
W = 148,
p = 0.462, effect size
r = 0.13 [
BF10 = 0.38,
W = 148,
R-hat = 1.00], median = −0.17 [Exercise] and −0.29 [Non Exercise], IQR = −0.34 to −0.06 [Exercise] and −0.47 to −0.14 [Non Exercise]).
To determine whether the amount of VPL differed across location, the pre- and post differences in VPL for the top (trained location) and bottom (untrained condition) were compared with Wilcoxon rank-sum tests. A larger VPL (median difference = −0.30) was observed for the Exercise group compared with the Non Exercise group (median difference = −0.12). For the Exercise group, W = −55.0, p = 0.003, effect size r = 0.75 [BF10 = 31.47, W = 13.0, R-hat = 1.03], median = −0.61 (top) and −0.17 (bottom), IQR = −0.87 to −0.23 (top) and −0.34 to −0.06 (bottom). For the Non Exercise group, two zero-difference pairs were removed from analyses and the Wilcoxon signed-rank test was completed with 14 pairs, such that W = −17.5, p = 0.003, effect size r = 0.28 [BF10 = 0.48, W = 35.0, R-hat = 1.00], median = −0.46 (top) and −0.29 (bottom), IQR = −0.76 to −0.18 (top) and −0.36 to −0.13 (bottom). However, these differences between the Exercise and Non Exercise groups (16 pairs) were not statistically significant (W = 81.0, p = 0.08, effect size r = 0.44 [BF10 = 0.766, W = 81.0, R-hat = 1.00], median = −0.30 [Exercise] and −0.05 [Non Exercise], IQR = −0.60 to −0.15 [Exercise] and −0.27 to −0.06 [Non Exercise].