To assess neurocognitive functionality, neuropsychologists challenge individuals in certain aspects of their performance using neuropsychological tests. Using these tests for neuropsychological diagnostics in applied research and clinical settings requires that measures of test performance can be obtained and interpreted efficiently and easily. Therefore, most readily available measures, such as reaction times and errors, are the variables most frequently used to derive test scores quantifying performance. One test widely used as an easily applicable screening of cognitive performance is the Trail Making Test (TMT) (
Reitan, 1958). In this test, participants connect a sequence of numbers (TMT-A) or alternating numbers and letters (TMT-B) via pencil on paper. Based on the time needed to complete the sequence, the participant’s level of executive functioning is evaluated (
Bowie & Harvey, 2006;
Salthouse, 2011;
Sánchez-Cubillo et al., 2009). This powerful but broad measure is frequently applied to examine and diagnose multiple neuropsychiatric disorders (e.g.,
Ashendorf, Jefferson, O'Connor, Chaisson, Green, & Stern, 2008;
Muir et al., 2015;
O'Rourke et al., 2011;
Wölwer & Gaebel, 2002) or investigate executive functions of healthy individuals (e.g.,
Hwang et al., 2016). However, a number of different cognitive functions are implicated in completing the sequence, so that completion time cannot differentiate between specific neurocognitive processes that underlie performance. To address this problem, recent studies have introduced measures beyond completion times that more specifically reflect cognitive processes and mechanisms and therefore offer a more detailed understanding of impaired and intact cognitive functionality. That is, they introduced a computerized version of the test that included eye tracking (
Linari, Juantorena, Ibáñez, Petroni, & Kamienkowski, 2022;
Recker, Foerster, Schneider, & Poth, 2022;
Wölwer & Gaebel, 2002). This addition allows, for example, assessing the number and length of participants’ eye fixations or the amplitude of their saccadic eye movements, both of which offer new information about cognitive processes, such as attentional selection of visual information and the allocation of cognitive processing resources (e.g.,
Hutton, 2008;
Liversedge & Findlay, 2000;
Salthouse & Ellis, 1980). Ongoing technological advancements such as portable eye trackers or virtual reality goggles (
Foerster, Poth, Behler, Botsch, & Schneider, 2016;
Foerster, Poth, Behler, Botsch, & Schneider, 2019) and their combination with eye tracking make it more and more feasible to apply experimental tests with eye tracking in neuropsychological testing scenarios. In the context of the TMT, several recent studies have already demonstrated the utility of eye tracking to understand task performance in terms of specific cognitive functions (
Linari et al., 2022;
Recker et al., 2022;
Wölwer & Gaebel, 2002). However, to use such new measures for individual diagnostics in research and applied settings, it is important to first establish that the measures are reliable (
Mollon, Bosten, Peterzell, & Webster, 2017;
Wilmer, 2017). This seems particularly important, because the reliability determines how strong the measures could maximally correlate with measures from other tests or diagnostic assessments (e.g., neurological assessments). As such, the reliability of the measures lays a necessary foundation for later validations based on correlations with other tests and clinical diagnostics (
Cronbach & Meehl, 1955). Therefore, here we aim to assess the test–retest reliability of a computerized TMT including a detailed neurocognitive profile of different eye-tracking-based measures first introduced by
Recker et al. (2022).