After successful validation of our analysis pipeline, we next turned to our empirical confidence ratings. Prior to applying our analysis pipeline to investigate insight into reality monitoring errors, we first checked whether there were any differences in metacognitive efficiency between the conditions using the standard hierarchical meta-
d′ model (
Fleming, 2017). This analysis revealed no significant differences in metacognitive efficiency (meta-
d′/
d′) between no imagery (95% HDI = 0.53 to 0.67), congruent imagery (HDI = 0.47 to 0.65), and incongruent imagery (HDI = 0.48 to 0.65), indicating that metacognitive sensitivity was similar between the conditions (
Supplementary Fig. S1). We next investigated the effects of condition on both confidence and the asymmetry in confidence criteria to characterize metacognitive insight (
Figure 2).
We first ran a repeated-measures analysis of variance (ANOVA) test on the confidence ratings from all conditions, including only data from participants with misses and false alarms in all conditions (
N = 59). This revealed no significant effects of condition (
Figure 4A). Furthermore, directly comparing confidence in presence and absence responses, irrespective of input, between no imagery to congruent imagery qualitatively followed the no-insight pattern, with higher confidence for presence responses and lower confidence for absence responses during congruent imagery (
Figures 4A,
4B), but there was no significant interaction between condition and response (
t(101) = 0.94,
p = 0.35). Fitting our extension of the hierarchical meta-
d′ model to our data provided a good fit (
Supplementary Fig. S2). The results showed that the asymmetry between the negative and positive confidence criteria was centered around 0, in line with no insight (
Figure 4C;
M = −0.07, 95% HDI = −0.33 to 0.22). Moreover, the relative confidence criteria were comparable between the imagery and no-imagery conditions, showing that they indeed shifted in tandem with the perceptual criterion shift (95% HDI Δ
c2 +
no_imagery−Δ
c2 +
congruent_imagery = −0.3 to 0.28 and 95% HDI Δ
c2 –
no_imagery−Δ
c2 –
congruent_imagery = −0.29 to 0.26). Finally, given that reaction time often (negatively) correlates with confidence (
Rahnev et al., 2020), we performed an exploratory analysis to investigate the effects of condition on reaction time. There was a significant interaction between condition and response (
t(101) = −2.07,
p = 0.041,
d = 0.26,
CI = −0.067 to −0.001;
Figure 4D), with faster presence responses during congruent imagery (
t(101) = 2.17,
p = 0.032,
d = 0.22;
CI = 0.282–0.696).
Given that first-order performance was higher than anticipated in Experiment 1, potentially due to our staircasing procedure, and that some results were unclear, we decided to repeat the experiment in an independent sample with a slightly different staircasing procedure (see Materials and Methods for more details) to ensure that our findings are replicable. The experimental procedure was mostly identical to the first but also included global insight questions at the end (see section below). After exclusion, data from 111 participants were analyzed. We first again replicated the decision-level effect, showing a decrease in criterion for congruent imagery (
M = 0.23,
SD = 0.50) compared to no imagery (
M = 0.35,
SD = 0.53,
t(110) = 2.48,
p = 0.015,
d = 0.22, 95%
CI difference = 0.02–0.21) but not for incongruent imagery (
M = 0.32,
SD = 0.56,
t(110) = 0.813,
p = 0.42,
d = 0.06, 95%
CI difference = −0.04 to 0.10). In contrast, there was a significant decrease in
d′ for incongruent imagery (
M = 1.56,
SD = 1.25), compared to no imagery (
M = 1.72,
SD = 1,
t(110) = 2.16,
p = 0.033,
d = −0.14, 95%
CI difference = 0.01–0.32), but not for congruent imagery (
M = 1.75,
SD = 1.27,
t(110) = −0.27,
p = 0.79,
d = 0.02, 95%
CI difference = −0.2 to 0.15). These results again show that participants were more likely to indicate perceptual presence when imagining the same stimulus, whereas imagining a different stimulus merely decreased performance. Finally, similar to Experiment 1, there were no significant differences in metacognitive efficiency between no imagery (HDI = 0.44 to 0.59), congruent imagery (HDI = 0.37 to 0.55), and incongruent imagery (HDI = 0.45 to 0.61;
Supplementary Fig. S1).
We next again first ran a repeated-measures ANOVA on the confidence ratings from all conditions, including only data from participants with misses and false alarms in all conditions (N = 59). This time, we did find a significant main effect of condition (F(182, 2) = 6.64, p = 0.002, ηp2 = 0.068). Post hoc tests revealed that this effect was driven by a decrease in confidence in the incongruent imagery condition (M = 52.1, SD = 15.2) compared to both no imagery (M = 53.5, SD = 15.5, t(91) = 2.28, p = 0.79, d = 0.02, 95% CI difference = −0.2 to 0.15) and congruent imagery (M = 1.75, SD = 1.27, t(110) = −0.27, p = 0.79, d = 0.02, 95% CI difference = −0.2 to 0.15). There was no significant difference between congruent imagery and no imagery in confidence ratings (t(91) = 1.322, p = 0.188, d = −0.138, 95% CI difference = −0.25 to 0.74). This decrease in confidence during the incongruent condition presumably reflected the decrease in performance in that condition. Interestingly, there was now also a significant interaction between condition and response (Huyn–Feldt corrected F(172.48, 1.9) = 3.937, p = 0.023, ηp2 = 0.041). Post hoc analyses revealed that confidence of presence responses, not absence responses, was higher for congruent imagery (M = 53.94, SD = 17.2) compared to no imagery (M = 51.19, SD = 17.83, t(91) = 2.71, p = 0.008, CI = 0.73–4.78, d = 0.16) and incongruent imagery (M = 50.15, SD = 17.9, t(91) = 4.3, p = 0.0004, CI = 2.04–5.55, d = 0.22).
In line with this, directly comparing confidence for presence and absence responses irrespective of input between only no imagery and congruent imagery revealed a significant interaction between condition and response on confidence ratings (
t(110) = 2.302,
p = 0.023,
d = −0.30,
CI = 0.30 to 4.05). Post hoc paired
t-tests revealed that confidence of presence responses, not absence responses, was higher for congruent imagery compared to no imagery (
t(110) = −2.37,
p = 0.019,
d = −0.23,
CI = −2.71 to −0.24;
Figures 4E,
4F), in line with no insight (cf.
Figure 2C). We next quantified metacognitive insight using our extension of the hierarchical meta-
d′ model. In line with the results from Experiment 1, we again found that the HDI was centered around 0, indicating that the confidence criteria moved along with the decision-level criterion, in line with no insight (
Figure 4G,
M = −0.14, 95% HDI = −0.44 to 0.15). Furthermore, the relative confidence criteria were again comparable between the imagery and no-imagery conditions (95% HDI Δ
c2 +
no_imagery−Δ
c2 +
congruent_imagery = −0.26 to 0.27 and 95% HDI Δ
c2 –
no_imagery−Δ
c2 –
congruent_imagery = −0.26 to 0.33). Finally, there was also a significant interaction between condition and response on reaction time (
t(110) = −3.65,
p = 0.004,
d = 0.37,
CI = −0.065 to −0.019), with faster presence responses during congruent imagery compared to no imagery (
t(110) = 2.14,
p = 0.035,
d = 0.20,
CI = 0.002 to 0.042), again in line with no insight.
Taken together, even though the results from the frequentist analyses of the confidence ratings were inconclusive, the qualitative pattern of results over the two experiments is in line with a no-insight model (cf.
Figure 2C).