Open Access
Article  |   October 2024
The dichoptic contrast ordering test: A method for measuring the depth of binocular imbalance
Author Affiliations
  • Alex S. Baldwin
    McGill Vision Research, Department of Ophthalmology and Visual Sciences, McGill University; McGill University Health Center, Montreal, QC, Canada
    alexsbaldwin@googlemail.com
  • Marie-Céline Lorenzini
    McGill Vision Research, Department of Ophthalmology and Visual Sciences, McGill University; McGill University Health Center, Montreal, QC, Canada
    marie-celine.lorenzini@umontreal.ca
  • Annabel Wing-Yan Fan
    McGill Vision Research, Department of Ophthalmology and Visual Sciences, McGill University; McGill University Health Center, Montreal, QC, Canada
    annabel.fan@mail.mcgill.ca
  • Robert F. Hess
    McGill Vision Research, Department of Ophthalmology and Visual Sciences, McGill University; McGill University Health Center, Montreal, QC, Canada
    robert.hess@mcgill.ca
  • Alexandre Reynaud
    McGill Vision Research, Department of Ophthalmology and Visual Sciences, McGill University; McGill University Health Center, Montreal, QC, Canada
    alexandre.reynaud@mcgill.ca
Journal of Vision October 2024, Vol.24, 2. doi:https://doi.org/10.1167/jov.24.11.2
  • Views
  • PDF
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Alex S. Baldwin, Marie-Céline Lorenzini, Annabel Wing-Yan Fan, Robert F. Hess, Alexandre Reynaud; The dichoptic contrast ordering test: A method for measuring the depth of binocular imbalance. Journal of Vision 2024;24(11):2. https://doi.org/10.1167/jov.24.11.2.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

In binocular vision, the relative strength of the input from the two eyes can have significant functional impact. These inputs are typically balanced; however, in some conditions (e.g., amblyopia), one eye will dominate over the other. To quantify imbalances in binocular vision, we have developed the Dichoptic Contrast Ordering Test (DiCOT). Implemented on a tablet device, the program uses rankings of perceived contrast (of dichoptically presented stimuli) to find a scaling factor that balances the two eyes. We measured how physical interventions (applied to one eye) affect the DiCOT measurements, including neutral density (ND) filters, Bangerter filters, and optical blur introduced by a +3-diopter (D) lens. The DiCOT results were compared to those from the Dichoptic Letter Test (DLT). Both the DiCOT and the DLT showed excellent test–retest reliability; however, the magnitude of the imbalances introduced by the interventions was greater in the DLT. To find consistency between the methods, rescaling the DiCOT results from individual conditions gave good results. However, the adjustments required for the +3-D lens condition were quite different from those for the ND and Bangerter filters. Our results indicate that the DiCOT and DLT measures partially separate aspects of binocular imbalance. This supports the simultaneous use of both measures in future studies.

Introduction
When operating under normal healthy conditions, the brain combine the visual input from the two eyes to give a unified binocular percept. This combination process is only partially understood, but it is known to involve both excitatory and inhibitory influences between the inputs from the two eyes (Ding & Sperling, 2006; Meese, Georgeson, & Baker, 2006). With basic psychophysics, we can explore these interactions by looking at performance on tasks involving binocular summation and interocular suppression. Although these interactions are typically “balanced” (e.g., suppression from the left eye to the right is similar to that from the right eye to the left), they can be severely imbalanced in some visual conditions such as amblyopia (Hess, Thompson, & Baker, 2014). 
An imbalance in binocular vision may be observed due to a weaker input from one eye, reducing its contribution to the binocular percept. This may also have a compounding effect, where that weaker signal also leads to that eye exerting a weaker suppression on the other eye. Alternatively, the two eyes may provide an equal strength of input, with the weight of suppression applied from each eye to the other being imbalanced (Wang, Baldwin, & Hess, 2021). The resulting imbalances can be observed through masking studies by measuring interactions between stimuli at corresponding locations in the two eyes (dichoptic overlay masking) or with stimuli at nearby non-overlapping locations (dichoptic surround masking). The two methods have been shown to reveal separable aspects of binocular combination and suppression (Huang, Baker, & Hess, 2012). Even the suppression effects have been shown to be complex, as demonstrated by both the tuning functions for dichoptic masking (Baker & Meese, 2007) and differential effects of short-term monocular deprivation that depend on mask orientation (Baldwin & Hess, 2018). It is important to note that, in some measurements, it may be difficult to distinguish between a situation where an eye is providing a weaker input and one where an eye is providing an equally strong input but is penalized by greater interocular suppression. For the brief measurements we discuss here, the two will likely have equivalent effects. 
There can also be a dynamic component to the imbalance between the eyes where the dominance of each eye's percept varies over time. This phenomenon is studied with the binocular rivalry paradigm, where incompatible spatially overlapping stimuli seen by the two eyes compete. This results in the percept switching between the inputs of the two eyes (Blake & Camisa, 1979). The binocular rivalry phenomenon is thought to involve a combination of low-level (eye-based suppression) and higher level (perhaps attentional) factors. There is also a further dynamic component of plasticity on rivalry, where perceptual experience can affect the degree of exclusivity of the monocular percepts (Klink, Brascamp, Blake, & van Wezel, 2010). For an expert review of binocular rivalry, the reader is directed to Alais (2012). In the current study, we are interested in measuring imbalance in the effective strength of stable monocular percepts, whatever the cause. 
Although our understanding of the mechanisms underlying binocular vision has advanced in recent years, it is far from complete (Hess et al., 2014). Myriad laboratory approaches have been devised to quantify imbalances in binocular vision (such as those induced by short-term deprivation of one eye; see various methods compared in Min, Gong et al., 2021); a non-exhaustive list is provided in Table A1. Each depends on a contrast-based metric that is consistent with the current view that the site of interocular suppression is located early in the cortical pathway where contrast gain control occurs (Baker, Meese, & Hess, 2008; Meese et al., 2006). These studies have provided information on the factors that influence suppression, such as the spatial and temporal properties, and the visual locus. A number of these different laboratory-validated approaches have been simplified to enable their use in the clinic, including motion coherence (Black, Thompson, Maehara, & Hess, 2011), Dichoptic Letters Test (DLT; Kwon, Wiecek, Dakin, & Bex, 2015), Worth 4 Dot test app (Webber, Mandall, Molloy, Lister, & Birch, 2020), phase combination (Ding & Sperling, 2006), and orientation combination (Wang et al., 2019). Of these, only the Worth 4 Dot test app does not require elaborate computer equipment, as it is implemented on a self-contained handheld digital device. It has been shown to be effective in measuring the size of the suppression scotoma, although it may fall short in the measurement of depth of suppression, possibly due to the added complication of having chromatic as well as achromatic contrast defining the stimuli (Webber et al., 2020). 
We have developed the Dichoptic Contrast Ordering Test (DiCOT) as a tool for making quick measurements of the relative contribution of the two eyes to binocular vision. A prototype application has been implemented on a handheld digital device (iPad tablet; Apple, Cupertino, CA) for convenience and portability. The subject views the screen through red/green anaglyph glasses, allowing for a dichoptic presentation of stimuli. The principle of the test is that an imbalance between the two eyes (i.e., due to imbalanced suppression) will cause the apparent contrast seen by one eye to be greater than that seen by the other (Bagolini, 1957; Reynaud & Hess, 2016). The goal of the measurement is to find a contrast scaling factor that equalizes the contrast seen by the two eyes. 
The contrast scaling factor is found by showing a set of images (Figure 1A) to the two eyes of various contrasts (three to the left eye and three to the right eye, in the current version of the method) and asking the participant to rank them in order from most to least “visible” (having previously trained the participant to understand that “visibility” refers to contrast). A single scaling factor adjusts the balance of contrasts seen by the two eyes, searching for a compensation that counteracts any binocular imbalance. To find the scaling factor, several “trials” are tested. On each trial, a new set of contrasts is shown to each eye, with the stimulus set being selected by an entropy-minimizing procedure that allows the tool to efficiently determine the most likely scaling factor in a small number of trials. This is achieved through a look-up table that maps the relative likelihood of each ranking order to a quantized list of possible scaling factors (Figure 1B). Bootstrapping is used to deliver this most likely value along with a 95% confidence interval (CI). An associated measurement error is a fundamental requirement of any clinical test that is involved with therapeutic assessment/endpoint evaluation as it allows statistical evaluations on a single-case basis. 
Figure 1.
 
Stimulus display for a single DiCOT trial. (A) Screenshot of a trial with an arrangement of left eye stimuli (in red) and right eye stimuli (in cyan), with each set having a high, medium, and low contrast stimulus determined by the algorithm to provide information on the binocular imbalance of the participant. Depending on the degree of imbalance, in some cases the lower contrast stimuli may not be visible to the participant. (B) Illustration of the look-up tables (generated by Monte Carlo simulation, one per stimulus condition). Each column is a probability distribution giving the probability of a particular ranking response being made for that stimulus condition by an individual with that interocular ratio. Each row is a likelihood function, where the ranking made in response to a stimulus condition indicates the likelihood of the participant's visual system being balanced by some interocular ratio parameter (from –18 to +18 dB).
Figure 1.
 
Stimulus display for a single DiCOT trial. (A) Screenshot of a trial with an arrangement of left eye stimuli (in red) and right eye stimuli (in cyan), with each set having a high, medium, and low contrast stimulus determined by the algorithm to provide information on the binocular imbalance of the participant. Depending on the degree of imbalance, in some cases the lower contrast stimuli may not be visible to the participant. (B) Illustration of the look-up tables (generated by Monte Carlo simulation, one per stimulus condition). Each column is a probability distribution giving the probability of a particular ranking response being made for that stimulus condition by an individual with that interocular ratio. Each row is a likelihood function, where the ranking made in response to a stimulus condition indicates the likelihood of the participant's visual system being balanced by some interocular ratio parameter (from –18 to +18 dB).
In the prototype application, three small images (cartoon pictures of fruit) are shown to each eye. A simpler version of the measurement would present only one image to each eye and ask which is of higher apparent contrast; however, we have a few reasons for preferring a ranking of multiple images. The first is that, on each trial, one of the images (the high-contrast image) shown to each eye is always at the maximum (100%) contrast. The other two images (medium and low contrast) vary in their contrast as determined by the algorithm. The goal of the fixed high contrast image is to prevent any eye-based contrast adaptation or renormalization that might possibly occur over the course of the measurement. The second reason is that the participant implicitly makes multiple comparisons of apparent contrast (among all of the available images on the display) at the same time. This provides more information about the correct scaling from a single presentation. The stimuli are shown continuously until the participant has completed the response for that trial. The outcome of this approach is a simple, game-like measurement involving touching elements on the screen in order of their perceived visibility. This task can be accomplished by children at least as young as 8 years old (Reynaud, Lorenzini, Koenekoop, Hess, & Baldwin, 2024). 
Various approaches have been evaluated to measure the strength of interocular suppression with a contrast-based metric (see Table A1). However, we would not consider any of these yet as a widely accepted clinical gold standard for quantitative measurement of the strength of suppression. In this study, we therefore chose to compare our test to the recent DLT (Kwon et al., 2015) because, like ours, it is designed as a clinical tool to quantify the binocular imbalance that may be due to suppression, and it has recently come into common use in the literature (Birch et al., 2016; Martín, Portela, Ding, Ibarrondo, & Levi, 2020; Sauvan et al., 2019). In this study, we introduced the new DiCOT method to measure imbalances in binocular vision. We analyzed the test–retest reliability of the DiCOT measurements and compared that result to the DLT. We also compared the results obtained by each measurement to the other. To simulate a range of binocular imbalances, measurements were taken from participants with normal vision, making use of three methods to introduce interocular imbalances: neutral density (ND) filters, Bangerter filters, and optical blur (Li et al., 2012). 
Materials and methods
Participants
The experiments were conducted in accordance with the procedures and protocol approved by the institutional research ethics board of the McGill University Health Center and complied with the tenets of the Declaration of Helsinki. Written informed consent was obtained. A total of 12 adult participants (age range, 19–71 years) took part in this study, with normal or corrected-to-normal vision. This was a number sufficient to estimate test–retest intraclass correlation coefficient (ICC) scores for each filter or lens condition with an expected precision of ±0.1 (Bonett, 2002). Four of the participants were study authors (AR, AW-YF, M-CL, RFH). The remaining participants were recruited from the local McGill University Health Centre community through word of mouth and posters. They were naïve to the expected impact of the experimental manipulations on their behavior in the DiCOT task. 
All participants were tested wearing their best correction. Screening was performed by the experimenters responsible for the study. Visual acuity was measured using a Snellen chart, and stereoacuity was measured with the Randot Preschool Stereotest (Stereo Optical Company, Chicago, IL). Normal vision was defined as better than or equal to 0.0 logMAR visual acuity in each eye with stereoacuity < 400 arcsec. Participants reporting any known visual, cognitive, or neurological impairments would have been excluded; however, this was not the case for any of our recruits. 
Experiment design
The relative strength of the two eyes, taken as an index of the imbalance of the contribution of the two eyes to binocular vision, was measured using the existing DLT (Kwon et al., 2015) and our newly developed DiCOT. Procedures and apparatus for these two tests are detailed below. To produce a wide range of imbalances, measurements were conducted under different lens or filter viewing conditions designed to affect interocular balance. The strength of these manipulations was determined based both on the previous experience of the experimenters and on our initial informal explorations (to try to avoid choosing a penalization for one eye that could be strong enough to render either task impossible). Vision was deprived by placing a Bangerter filter (0.8 power), 1-ND filter (10% transmittance), or a +3-D lens in front of one eye. Deprivation of both the left eye and the right eye was performed for each condition. We also measured the baseline with no filter or lens. This made for a total of seven conditions to be tested per subject. To measure test–retest reliability, all measures were performed twice in a randomized order. 
Dichoptic contrast ordering test
Stimuli and apparatus
The DiCOT stimuli were displayed dichoptically using a color anaglyph presentation. Cyan (with contrast only in the green and blue color channels) and red (with contrast only in the red color channel) cartoon drawings representing fruit (i.e., apple, cherry, or grape) with normalized root-mean-square (RMS) contrast (normalized for luminance contrast across the different fruits and different anaglyph eye presentations) were displayed on an Apple iPad Mini display (5th generation, A2133; resolution, 2048 × 1536 pixels; 128 pixels/cm) with a uniform white background (561 cd/m2, measured with an LS-150 Luminance Meter; Konica Minolta Sensing Americas, Ramsey, NJ) with varying contrasts. The software was written in the Unity Engine (Unity Technologies, Copenhagen, Denmark). The screen was viewed through red/green anaglyph glasses (Gulden Ophthalmics, Elkins Park, PA) at a distance of 40 cm. The fruit pictures each filled a 2.5-cm (3.6 degrees of visual angle, or “deg”) square box and were positioned in a square grid arranged 4 cm (5.7 deg) apart center-to-center. 
Through the red filter, the screen luminance was 26 cd/m2, whereas it was 52 cd/m2 through the green filter. Across this relatively modest range of luminances, we would expect contrast constancy to hold (Ashraf, Mantiuk, Martinovic, & Wuerger, 2022), otherwise this would result only in a constant bias of the “balance” point of our measurement. Through the red filter, the cyan stimuli appeared darker than the background. Through the green filter, the red stimuli appeared darker than the background. A custom-written shader was used to apply luminance gamma correction for linear contrast control and to balance the contrast seen by the two eyes through the red/green anaglyph presentation. This means that the contrasts we defined for our stimuli were the effective luminance contrast after the image had passed through the colored anaglyph filter. Spectral measurements from the iPad display made with an X-Rite i1Pro (VPixx Edition; VPixx Technologies, Saint-Bruno-de-Montarville, QC, Canada) showed that the red and green color channels were not entirely independent. Attempts to show green stimuli also introduced a small red component. The result of this for our anaglyph presentation was a cross-talk image at 20% of the presented contrast when viewed through the red filter as compared with only 3% contrast when viewed through the green filter. The expected outcome of this cross-talk was a small bias favoring the green-filter eye. 
Procedure
Testing began with a few example pages where participants had to select some binocularly presented cartoon images of fruit in order from high to low contrast (although the on-screen instructions used the term “visibility,” the examples provided served the purpose of linking the idea of visibility to contrast). Selecting the fruit in the correct order (by tapping the screen with a finger) was a requirement to pass through this training and ensured that participants understood how to perform the task required. The participant then proceeded to the measurement trials. 
On each measurement trial, participants saw six cartoon images of fruit (Figure 1A) at various contrasts. These were presented dichoptically so three were seen by each eye. Their task was to select the fruits in order from the highest to lowest apparent contrast by tapping on them with a finger (“Please tap fruits from most to least visible”). When a fruit was tapped, there was both visual feedback (the fruit became crossed-out) and auditory feedback. If the participant decided they made a mistake and tapped the wrong fruit, they could choose to reset the trial by tapping a “reset” button. The trial was completed when all six fruits had been tapped, initiating a 0.5-second delay before the next trial was presented. It is possible that on some trials the lower contrast images may not have been visible (e.g., if a measurement was being made under conditions that heavily affected the strength of the input from one eye). Provided the participant had at least one eye with good contrast sensitivity we would always expect a minimum of three fruits (seen by that better eye) to be visible. If any fruits were still not visible, then the participant was instructed to guess. To prevent the measurement from stalling if the participant would not guess, a skip button became available 15 seconds after the third fruit was selected. This had the effect of selecting the remaining fruit in random order on behalf of the participant (similar to the “guess button” introduced by García‐Pérez, 2010), thus ending the trial. 
For the first trial, the stimulus set showed the same range of contrasts to the two eyes (100%, 53%, and 28%). After this, the contrast on each trial was set by an entropy-minimizing procedure that attempted to determine the best value for the interocular contrast ratio in the fewest trials. Interpretation of the rankings provided by the participant depends on a set of look-up tables that give the probability of different ranking responses for the different stimulus conditions under various assumed interocular contrast gain ratios (Figure 1B). These were generated through Monte Carlo simulation (100,000 simulations per stimulus condition and per parameter value, which stepped from –18 dB to +18 dB in 0.5-dB intervals). In the simulations, the input gain of the two eyes varied according to the interocular gain ratio, with the judgment of contrast being linear but affected by a 10% internal additive Gaussian contrast “noise.” Varying this noise parameter had the effect of making the likelihood functions overall broader (for large values) or narrower (for small values). The value of 10% was not selected on the assumption that it is a good estimate of the amount of noise affecting this process in human vision. Instead, it is a value that gives reasonably broad likelihood functions that do not challenge the precision limit for representing and manipulating probability values (which may become very small) in the software. In the stimulation, the participant simply selects the targets in order of the amplitude of the noisy response to each target. In the tables, each column gives the relative frequency of the 720 (6! for the ranking of six targets) possible ranking outcomes generated on the basis of a specific stimulus condition and interocular gain ratio. Therefore, the values in each column must add up to 1. 
When a participant completed their ranking in response to the stimulus condition, the corresponding row in the relevant look-up table was selected (Figure 2A). This indicates the relative likelihood of each of the possible (quantized) parameter values on the basis of the ranking given in that trial. We say relative likelihood here, as we are ignoring the possibility of the parameter taking a value either between those we sampled (our quantization was chosen to give a precision that would be realistic based on the length of the measurement) or outside that range (for such large imbalances, the algorithm returns its maximum of either –18 dB or +18 dB). The log of this value is added to the sum of all of the previous log likelihoods (Figure 2B) to obtain an overall log-likelihood function, the peak of which indicates the most likely parameter value given the data (under the assumption that the prior for this parameter is flat). A 1% probability of lapsing (responding randomly) on each trial was also incorporated in this calculation. After each trial, the updated likelihoods of the different parameter values were used to select the stimulus condition for the next trial. The selected stimulus condition was the one that minimized the expected posterior entropy from the response of that trial (see Appendix B). In other words, the algorithm considers all of the possible conditions that could be tested on the next trial and (based on the previous responses) chooses the one that will be most informative regarding the imbalance. 
Figure 2.
 
Post-response analysis performed in the DiCOT. (A) The ranking given by the participant selects a row in the look-up table for that trial. (B) The likelihoods from this row are converted to log values before being added to the cumulative history of the log likelihoods from all previous trials. The curves in this illustration show relative log likelihoods of the different parameter values based on the cumulative history up to either trial (i – 1), in gray, or up to trial i, in dark blue. This example demonstrates the possible impact of adding trial i, which is to sharpen the estimate of the best parameter value, as well as to vertically translate the function downward.
Figure 2.
 
Post-response analysis performed in the DiCOT. (A) The ranking given by the participant selects a row in the look-up table for that trial. (B) The likelihoods from this row are converted to log values before being added to the cumulative history of the log likelihoods from all previous trials. The curves in this illustration show relative log likelihoods of the different parameter values based on the cumulative history up to either trial (i – 1), in gray, or up to trial i, in dark blue. This example demonstrates the possible impact of adding trial i, which is to sharpen the estimate of the best parameter value, as well as to vertically translate the function downward.
The algorithm terminated when a certain confidence interval limit (3 dB) was reached: after a minimum of 10 trials or after a maximum of 20 trials if the confidence intervals were not sufficiently narrow by that time. The 95% CI was determined through non-parametric bootstrapping (1000 samples), randomly resampling from the individual trials and selecting the most likely value from each bootstrapped dataset. A subset of the measurements was timed; from these, the typical duration was approximately 2 to 3 minutes. 
Dichoptic letters test
Stimuli and apparatus
The DLT was adapted from Kwon et al. (Birch et al., 2016; Kwon et al., 2015). This method consists of reading dichoptically displayed bandpass 2-cyc/deg filtered Sloan letters arranged in a layout similar to the Early Treatment Diabetic Retinopathy Study (ETDRS) acuity chart on a gray background (60 cd/m2). The stimuli were generated and controlled using MATLAB 9.7.0 (MathWorks, Natick, MA) with Psychophysics Toolbox extensions for Windows 10 (Microsoft, Redmond, WA), running on a PC desktop computer. Stimuli were presented dichoptically on an active three-dimensional (3D) display setup using NVIDIA 3D Vision 2 shutter glasses (NVIDIA, Santa Clara, CA) running at a 60-Hz frame rate per eye and a liquid-crystal display (LCD) monitor (VS278H-E; refresh rate, 120 Hz; resolution, 1920 × 1080; maximum luminance, 250 cd/m2; ASUSTeK Computer, Taipei, Taiwan). Participants were seated 50 cm away from the screen in a dimly lit room. Two different sets of five letters of different contrast were presented dichoptically overlapping to each eye. At each position, the identity and interocular contrast ratio of the letter on each chart differed. 
Procedure
The balance point between the two eyes, the interocular contrast ratio required to report the letter in each eye with equal probability, was determined via an adaptive procedure described in detail by Kwon et al. (2015). In each trial, participants were instructed to read aloud the more visible letter at each of the five positions in the chart from left to right, and the experimenter then manually input the letters. The relative contrast of overlapping letters in each eye was adjusted via a customized adaptive procedure adapted from QUEST (Watson & Pelli, 1983) for a total of 60 data points (12 charts × 5 contrast ratios) (Kwon et al., 2015). The task typically took 1 to 2 minutes. 
Statistical analysis
Different filters and lenses (i.e., ND filter, Bangerter filter, and +3-D blurring lens) were tested twice separately on each eye of the participants performing both tests. In each test, the interocular suppression ratio was expressed in decibels, where 0 dB indicated balance (equal suppression from the two eyes) and each 6 dB in either direction indicated a 2:1 ratio in the strength of suppression favoring one eye or the other. The degree of suppression, which relies on a contrast-based metric, is plotted logarithmically in decibels:  
\begin{eqnarray}{\rm{dB\ ratio}}\, &=& 20\, \times {\rm{lo}}{{\rm{g}}_{10}}\nonumber\\ && \left( {\frac{{{\rm{Effective\ }}{{\bf right}}{\rm{\ eye\ input\ strength}}}}{{{\rm{Effective\ }}{{\bf left}}{\rm{\ eye\ input\ strength}}}}} \right)\qquad \end{eqnarray}
(1)
 
A measurement of 0 dB would indicate perfect binocular balance, with each 6-dB step above or below that equivalent to a factor of 2 ratio in either direction. Penalizing the input from the left eye would therefore be expected to have the effect of making the decibel ratio more positive. 
Statistical analysis was performed in R (R Core Team, 2023), with additional packages providing ICC analysis (Gamer, Lemon, Fellows, & Singh, 2022) and Bland–Altman analysis (Datta, 2024). We assessed the test–retest reliability of each method using the agreement version of the ICC and a Bland–Altman plot (Bland & Altman, 1999) where the difference between the two values (test–retest) was plotted as a function of the mean of the two values for each subject. We then compared the DiCOT against the DLT. When we compared the two tests against each other, the consistency version of the ICC was used. We also looked for the best linear relationship between the decibel values obtained on the two tests and used this to perform a transformation of the DiCOT values to put them in their best agreement. The relationship between these values and the DLT were also assessed through a scatterplot and a Bland–Altman plot (taking the DLT as the reference value for the x-axis). 
Results
Influence of simulated monocular visual loss on interocular suppression ratios
In total, 168 measurements were completed with the DiCOT. For 15 of these measurements, the information regarding the confidence interval width was lost. For the remaining 153 measurements, we found that the 3-dB CI width criterion was satisfied in 93% of measurements. The remaining 7% reached 20 trials without the CI width reaching 3 dB or lower. For the DLT, two participants in the +3-D lens condition could not perform the measurement on either the test or the retest measure. For the DiCOT, we looked for a relationship between the absolute degree of imbalance and the associated error (95% CI width). We found no significant relationship (Pearson r = 0.00, p = 0.98). 
In Figure 3, we compare the imbalance that was introduced in the results of the two tests for the various filter and lens manipulations we used. Each method introduced an imbalance that could be quantified by both the DiCOT and DLT. These imbalances tended to result in more extreme results on the DLT than on the DiCOT (comparing the standard deviation of all measurements pooled together: 3 dB vs. 10 dB). Figure 3A presents the average measurements for the 12 participants obtained with the DiCOT as a function of the type of filter worn. Deprivation of the right eye would lead to negative values, whereas deprivation of the left eye would lead to positive values. Figure 3B displays the same measurements but obtained using the DLT. A similar pattern is observed; however, the effect of the lens (+3 D) was more exaggerated in the DLT compared to the DiCOT. We believe this is due to the DLT being tested at a farther distance than the DiCOT (50 cm vs. 40 cm). At this distance, the lens has a more dramatic effect on visual acuity (20/125 reduction compared with 20/50). 
Figure 3.
 
Binocular imbalance without lens/filter and under three lens/filter conditions. The bar graphs represent the mean interocular suppression ratio parameter at baseline and in the three conditions where the filter or lens is placed over either the left (OS) or right eye (OD). (A) Results from the DiCOT. (B) Results from the DLT. Error bars represent the bootstrapped 95% CI of the mean. Individual participant data are shown for each bar with the cross symbols.
Figure 3.
 
Binocular imbalance without lens/filter and under three lens/filter conditions. The bar graphs represent the mean interocular suppression ratio parameter at baseline and in the three conditions where the filter or lens is placed over either the left (OS) or right eye (OD). (A) Results from the DiCOT. (B) Results from the DLT. Error bars represent the bootstrapped 95% CI of the mean. Individual participant data are shown for each bar with the cross symbols.
Test–retest reliability of the interocular suppression assessment methods
The test–retest comparisons made with the two measurement tools are shown in Figure 4. The plots in the left column are the test–retest scatterplots. Figure 4A shows this comparison for the DiCOT, with the agreement between the test and retest measurements quantified by the ICC agreement score. This is a specific test of the extent to which the second measurement differs from the first (i.e., it is not a simple correlation measure looking for an unconstrained linear relationship between the two). With this, we obtained a value of 0.91, which can be interpreted as excellent (Cicchetti, 1994). A marginally better result (0.97) was found for the DLT, as shown in Figure 4C. 
Figure 4.
 
Test–retest reliability. The left column shows correlations between first and second (test and retest) measurements obtained using the DiCOT (A) and the DLT (C). The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition (84 in total for the DiCOT, 82 in the DLT due to cases where a measurement result could not be obtained). The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The Bland–Altman plot for the DiCOT is given in B, corresponding to the test-retest plot in A. The Bland–Altman plot for the DLT is given in D, corresponding to the test-retest plot in C. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The results for each filter or lens condition are shown individually in Figure A1, and the statistics are reported in Table 1.
Figure 4.
 
Test–retest reliability. The left column shows correlations between first and second (test and retest) measurements obtained using the DiCOT (A) and the DLT (C). The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition (84 in total for the DiCOT, 82 in the DLT due to cases where a measurement result could not be obtained). The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The Bland–Altman plot for the DiCOT is given in B, corresponding to the test-retest plot in A. The Bland–Altman plot for the DLT is given in D, corresponding to the test-retest plot in C. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The results for each filter or lens condition are shown individually in Figure A1, and the statistics are reported in Table 1.
The right column of Figure 4 shows the Bland–Altman plots for each method. We see that the limits of agreement (LoA) are narrower (around half the width; see bottom row of Table 1) for the DiCOT than they are for the DLT. This is unsurprising, as the range of values obtained is also narrower for this measurement. Visual inspection reveals no evident systematic relationship between the degree of imbalance and the difference between test and retest measurements. 
Table 1.
 
Test–retest reliability statistics shown in Figure 4 and Figure A1. The agreement form of the ICC is reported, as well as the bias and LoA from the Bland–Altman plots.
Table 1.
 
Test–retest reliability statistics shown in Figure 4 and Figure A1. The agreement form of the ICC is reported, as well as the bias and LoA from the Bland–Altman plots.
A further analysis was performed on the data from the individual filter and lens conditions. The scatterplots and Bland–Altman plots are provided in Figure A1, and the statistics are reported in Table 1. In most cases, the test–retest reliability was comparable between the different conditions. The exception was for testing without a filter or lens. For the DiCOT, in particular, the ICC agreement score was lower in this condition (0.78); however, it was still in the “excellent” range (Cicchetti, 1994). The reason for the poorer performance of the DiCOT in this case is likely due to the smaller range of imbalances found in our study participant population (in the absence of filter or lens manipulations designed to introduce imbalances). 
Comparison between interocular suppression assessment methods
Interocular suppression ratios obtained with the DLT are compared against those measured with the DiCOT in Figure 5. As in Figure 4, each participant and lens/filter condition are plotted as a separate point. The top and bottom rows give the data from the test and retest measurements, respectively. The consistency ICC between the two measurements was poor (0.32) in both cases. Looking for any linear relationship in the scatterplots (left column), we find best-fitting lines with slopes of 0.18 on the log–log axes for both the test and retest measurements. This accounts for 27% to 29% of the variance. Because the slope of this relationship was not unity, we observe a systematic relationship between the DLT result and the difference between the DLT and DiCOT results in the Bland–Altman plots (right column of Figure 5). 
Figure 5.
 
Comparison between DLT and DiCOT. The left column shows scatterplots comparing the measurement made with one test against that made with the other. The top row (A) compares the test (first) measurements, and the bottom (C) compares the second (retest) measurements. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots (B for the first measurements, D for the second measurements). The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The results for each filter or lens condition are shown individually in Figure A2, with statistics reported in part in Table 2 and in full in Table A1.
Figure 5.
 
Comparison between DLT and DiCOT. The left column shows scatterplots comparing the measurement made with one test against that made with the other. The top row (A) compares the test (first) measurements, and the bottom (C) compares the second (retest) measurements. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots (B for the first measurements, D for the second measurements). The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The results for each filter or lens condition are shown individually in Figure A2, with statistics reported in part in Table 2 and in full in Table A1.
We performed a separate analysis of the relationship between the DiCOT and DLT results obtained with the different filter and lens manipulations. The results are is plotted in Figure A2, and the full statistics are reported in Table A1. The consistency ICC was poorest for the measurements without a filter or lens, likely due to the small baseline variation in the imbalance in our population relative to the variability in the measurements. The consistency ICC for the +3-D lens condition was also lower than for the other filter and lens conditions, as would be expected based on the differences in the magnitudes of the effects seen in Figure 3. These magnitudes would affect the slope of the linear relationship between the DLT and the DiCOT, to which the consistency ICC metric is sensitive (as it assumes that the effects of the different measurements on the two tests will have equal magnitude and so the slope will be unity). The slopes from the scatterplots are summarized in Table 2. Without a filter or lens, there was no significant relationship between the DiCOT and the DLT. The Bangerter and ND filter conditions had steeper slopes (0.31–0.39), and the +3-D lens condition had a shallower slope (0.10). 
Table 2.
 
Consistency ICC calculated for the individual lens/filter conditions (as shown in Figure 5), in each case combining the OS and OD data. As there was no significant linear relationship for the no filter/lens condition, the data were not included for the individual analysis with the adjusted DiCOT.
Table 2.
 
Consistency ICC calculated for the individual lens/filter conditions (as shown in Figure 5), in each case combining the OS and OD data. As there was no significant linear relationship for the no filter/lens condition, the data were not included for the individual analysis with the adjusted DiCOT.
To improve the consistency between the DLT and DiCOT, we used the best-fitting linear slope and constant parameters to transform the DiCOT results such that they should have their best correlation with the DLT. The relationships with the adjusted DiCOT results are shown in Figure 6. As expected, this slightly improved the ICCs to 0.43 to 0.45 (“fair” by Cicchetti, 1994). The right column shows the Bland–Altman plots calculated on the basis of the adjusted DiCOT values. The LoA here are very broad (±32 dB). The variety of differences between the two measurements may account for the discrepancy between the results obtained with them. We performed an additional, separate analysis for each of the filter and lens conditions. These were transformed based on the linear relationships for each condition reported in Table A1. As there was no significant relationship for the no filter/lens condition, that condition was excluded from this analysis. 
Figure 6.
 
Comparison between DLT and adjusted DiCOT measurements. Panel A shows the relationship between the two measurements for the first (test) measurements, with the Bland–Altman plot shown in B. Panel C shows the relationship for the second (retest) measurements with the Bland–Altman plot shown in D. Similar to Figure 5; however, in this case the DiCOT values are adjusted according to the best-fitting linear relationships shown in green in Figure 5A (for the test measurements) and Figure 5C (for the retest measurements). The results for each filter or lens condition are shown individually in Figure A3, with statistics reported in-part in Table 2 and in-full in Table A2.
Figure 6.
 
Comparison between DLT and adjusted DiCOT measurements. Panel A shows the relationship between the two measurements for the first (test) measurements, with the Bland–Altman plot shown in B. Panel C shows the relationship for the second (retest) measurements with the Bland–Altman plot shown in D. Similar to Figure 5; however, in this case the DiCOT values are adjusted according to the best-fitting linear relationships shown in green in Figure 5A (for the test measurements) and Figure 5C (for the retest measurements). The results for each filter or lens condition are shown individually in Figure A3, with statistics reported in-part in Table 2 and in-full in Table A2.
The mean consistency ICCs for each filter or lens condition are provided in Table 2. In each case, the consistency between the adjusted DiCOT and DLT values was better (0.56–0.63) for the individual conditions than for the pooled data (0.44). So, overall, we do see consistency between the two tests within an individual lens or filter condition, but the relative scaling of the impact of the different manipulations varies between the two methods. Specifically, the Bangerter filter and ND filter seem to have a similar relationship, but the +3-D lens has a relatively small effect measured in the DiCOT compared with the DLT (as shown in Figure 3). This may be expected, as the viewing distance for the DLT was greater and so the impact of the blur from the lens would also be greater. We also see much larger Bland–Altman LoA in this condition compared with the Bangerter or ND filter conditions. 
Discussion
This study introduces a new quick method for measuring the relative strength of the contribution of the two eyes to binocular vision in the lab or clinic. With an anaglyph presentation, it can be delivered on a variety of digital devices. It involves a simple task of ranking the visibility of dichoptically presented stimuli, which after only 10 to 20 trials provides a quantitative estimate of the magnitude and direction of any imbalance along with the associated measurement error. It has excellent test–retest reliability both in the combined analysis of all of our conditions (Figure 4A) and in the analysis of the results from individual filter or lens conditions (Figure A1). 
We chose to compare the results of the DLT (Kwon et al., 2015) with those obtained from the DiCOT. We did not expect that the two methods would measure an identical interocular suppression process. The DLT measures overlay suppression—namely, suppression between corresponding regions of the visual field in both eyes which are presented with rivalrous stimuli. Our DiCOT, on the other hand, measures dichoptic differences in perceived contrast, which may be mediated by suppression arising from surrounding regions of the visual field from the other eye. We expected the DLT result to mostly reflect feedforward and local intercortical interactions, whereas the DiCOT was expected to depend more on feedback interactions from higher cortical areas (Angelucci & Shushruth, 2013). However, what the tests have in common is that they both are suitable to be used for clinical measurements. They are appropriate for children as well as for adult patients, and they both provide the error associated with the measurement. 
When penalizing lenses and filters were used, both tests showed changes in the binocular imbalance (Figure 3). The DLT, which we propose is based on effects of overlay suppression and rivalry dominance (where that rivalry state itself may affect the behavior of the system; see Klink et al., 2010), gave more extreme values than the DiCOT. Our DiCOT measure is designed to reflect both the feed-forward contrast gain of the inputs of the two eyes (without local competition), as well as any effects of dichoptic surround suppression from the elements seen in the other eye. Both tests exhibited excellent test–retest reliability. Although the pattern of results found with the two tests was similar, the results from one accounted for around a third of the variance in the results from the other (Figure 5). We argue that they target (partially) distinct aspects of binocular vision and so are differentially affected by our different lens and filter condition manipulations. This is reflected by the different regression slopes reported in Table 2. For example, to the extent that the DLT is a rivalry-based task, there is an expectation that a blurred input in one eye (from our +3-D lens) would greatly reduce the dominance of that blurred image (Arnold, Grove, & Wallis, 2007). On the other hand, we would also expect a (less dramatic) reduction in the perceived contrast in the DiCOT if the image were blurred in one eye (May & Georgeson, 2007). This exaggerated effect of blurring on the DLT is reflected visually in our results (Figure 3) and by the relatively shallow regression slope (Table 2). Determining the best instructions and training to separate blur from contrast judgments in the DiCOT is a topic for future research. If we adjusted the data from each filter or lens condition individually, we did find they each showed a comparable consistency between the DiCOT and the DLT (consistency ICC scores of 0.56–0.63; see Table 2). This is still lower than the test–retest reliability, indicating a residual difference between the two measurements. The possibility that the DiCOT and the DLT may target distinct aspects of binocular imbalance makes the combined use of both tests interesting. One such study has been performed comparing the two measurement tools in amblyopia (Reynaud et al., 2024). 
Although both the DLT and the DiCOT should reflect imbalances in the gain of the input from each eye, we found (as mentioned above) a wider range in the DLT results. This may be due to the rivalrous stimulus design. Exaggeration of the imbalance for measurements made with rivalrous stimuli would also be expected in amblyopia (Meier, Tarczy-Hornoch, Boynton, & Fine, 2023). The other difference of interest to us was the ways in which the two tasks would be affected by interocular suppression. Spatially, suppression can be induced by both stimuli occupying a corresponding retinal locus (overlay suppression) and those in neighboring regions of the visual field (surround suppression). Overlay suppression is thought to mostly involve local responses from neurons responding within the stimulus area and exhibits orientation tuning. Surround suppression involves more remote suppressive interactions. These involve both intracortical horizontal connections and likely feedback contributions from higher visual areas (Angelucci & Shushruth, 2013). Very remote surround interactions do not exhibit orientational tuning (Shushruth et al., 2013). Interocular suppression in individuals with normal healthy vision that have induced interocular imbalances or in amblyopic individuals who have intrinsic interocular imbalances as a result of a disruption of normal visual development involves contributions of both overlay and surround suppression (Huang, Zhou, Lu, Feng, & Zhou, 2012). This imbalance in amblyopia does not affect behavior on contrast matching tasks when stimuli are presented monocularly (Hess & Bradley, 1980), suggesting that there is no suprathreshold impact of interocular suppression on perceived contrast under monocular stimulation. Under dichoptic viewing, however, contrast matching is impacted (Reynaud & Hess, 2016). The specificity of this phenomenon to simultaneous dichoptic presentation underlies our argument that imbalances measured with the DiCOT can be considered to reflect interocular suppression. However, for the many purposes for which this tool may be used, the distinction between imbalances in gain and those in suppression may be irrelevant. 
Limitations and future work
A number of possible improvements to the DiCOT measurement have been recognized throughout the study process. From the beginning, we anticipated that the multiple comparisons made by the participants should allow, in principle, for the participants’ responses to be validated for internal consistency. This will be implemented in a future iteration. In the Monte Carlo simulations that generated our look-up tables, we used a linear response to contrast. This means that, in the simulated rankings, equal differences in linear contrast are equally discriminable. To better correspond to the human visual system, the look-up tables would be improved if they instead operated on a logarithmic contrast scale. This would make the discriminable differences follow the Weber–Fechner law, where the discriminable increment in some property (such as contrast) is proportional to the baseline amount (Wixted, 2019). The effect of such a change will be to rebalance the relative importance of transpositions made in the ranking of contrasts in the lower or higher contrast range. A linear simulation is “better” at ranking the higher contrast stimuli accurately than a simulation following the Weber–Fechner law would be. This means that deviations from the expected ranking order made with those higher contrasts are less probable and so will have a greater impact on the predicted imbalance in the current form of the DiCOT. The effect of changing this rule will be explored in a future study with a new version of the DiCOT. 
There is also a need to investigate how technical aspects of the stimulus presentation (e.g., alternatives to the anaglyph mode of dichoptic presentation) and design (e.g., spacing between the targets, broadband vs. spatially bandpass targets) affect the measurements. The current study compared the DiCOT presented via a red/green anaglyph on an iPad against the DLT presented on a 3D display system using active shutter glasses. A comparison was made between the two measurements on the different technologies as there was not a version of the DLT available for color anaglyphs, nor do we have a version of the DiCOT for an active 3D system. So, effects due to the equipment may have impacted any differences we found. A comprehensive analysis of these factors will be pursued in future studies. The participants free-viewed the stimuli without eye-tracking, so there was no opportunity to analyze the potential impacts of eye movements on their behavior. We also did not instruct the participants to prioritize the speed of their response, so we do not know whether there was a relationship between the imbalance and the speed with which the task could be performed. These aspects may be of scientific interest in future investigations. 
Acknowledgments
The authors thank MiYoung Kwon and Peter Bex for sharing the code of the DLT. 
Novartis Pharma AG (Basel) supported the study with funds under a research agreement entered into on March 31, 2020. 
Commercial relationships: A.S. Baldwin (P, PCT/CA2024/050874); M.-C. Lorenzini, none; A.W.-Y. Fan, none; R.F. Hess (P, PCT/CA2024/050874); A. Reynaud, none. 
Corresponding author: Alex S. Baldwin. 
Email: alexander.baldwin@mcgill.ca. 
Address: McGill Vision Research, Department of Ophthalmology and Vision Sciences, McGill University; McGill University Health Center, Montreal, QC H3G 1A4, Canada. 
References
Alais, D. (2012). Binocular rivalry: Competition and inhibition in visual perception. Wiley Interdisciplinary Reviews: Cognitive Science, 3, 87–103, https://doi.org/10.1002/wcs.151. [PubMed]
Angelucci, A., & Shushruth, S. (2013). Beyond the classical receptive field: Surround modulation in primary visual cortex. In: Werner, J. S. & Chalupa, L. M. (Eds.), The new visual neurosciences, (pp. 425–444). Cambridge, MA: MIT Press.
Arnold, D. H., Grove, P. M., & Wallis, T. S. A. (2007). Staying focused: A functional account of perceptual suppression during binocular rivalry. Journal of Vision, 7(7):7, 1–8, https://doi.org/10.1167/7.7.7.
Ashraf, M., Mantiuk, R., Martinovic, J., & Wuerger, S. (2022). Suprathreshold contrast matching between different luminance levels. In: 30th Color and Imaging Conference Final Program and Proceedings: Color Science and Engineering Systems, Technologies, and Applications (pp. 219–224). Springfield, VA: Society for Imaging Science and Technology.
Babu, R. J., Clavagnier, S. R., Bobier, W., Thompson, B., & Hess, R. F. (2013). The regional extent of suppression: Strabismics versus nonstrabismics. Investigative Ophthalmology & Visual Science, 54, 6585, https://doi.org/10.1167/iovs.12-11314. [PubMed]
Bagolini, B. (1957). Presentazione di una sbarra di filtri a densita scalare assorbenti i raggi luminosi. Bollettino d'oculistica, 36, 638–651. [PubMed]
Baker, D. H., & Meese, T. S. (2007). Binocular contrast interactions: Dichoptic masking is not a single process. Vision Research, 47, 3096–3107, https://doi.org/10.1016/j.visres.2007.08.013. [PubMed]
Baker, D. H., Meese, T. S., & Hess, R. F. (2008). Contrast masking in strabismic amblyopia: Attenuation, noise, interocular suppression and binocular summation. Vision Research, 48, 1625–1640, https://doi.org/10.1016/j.visres.2008.04.017. [PubMed]
Baldwin, A. S., & Hess, R. F. (2018). The mechanism of short-term monocular deprivation is not simple: Separate effects on parallel and cross-oriented dichoptic masking. Scientific Reports, 8, 1–8, https://doi.org/10.1038/s41598-018-24584-9. [PubMed]
Baldwin, A. S., Lorenzini, M.-C., Fan, A.W.-Y., Hess, R. F., & Reynaud, A. (2024). The Dichoptic Contrast Ordering Test: A method for measuring the depth of binocular imbalance, medRxiv, https://doi.org/10.1101/2024.08.05.24311503.
Birch, E. E., Morale, S. E., Jost, R. M., De La Cruz, A., Kelly, K. R., Wang, Y.-Z., & Bex, P. J. (2016). Assessing suppression in amblyopic children with a dichoptic eye chart. Investigative Ophthalmology & Visual Science, 57, 5649, https://doi.org/10.1167/iovs.16-19986. [PubMed]
Black, J. M., Thompson, B., Maehara, G., & Hess, R. F. (2011). A compact clinical instrument for quantifying suppression. Optometry and Vision Science, 88, E334–E343, https://doi.org/10.1097/OPX.0b013e318205a162.
Blake, R., & Camisa, J. (1979). On the inhibitory nature of binocular rivalry suppression. Journal of Experimental Psychology: Human Perception and Performance, 5, 315–323. [PubMed]
Bland, J. M., & Altman, D. G. (1999). Measuring agreement in method comparison studies. Statistical Methods in Medical Research, 8, 135–160, https://doi.org/10.1177/096228029900800204. [PubMed]
Bonett, D. G. (2002). Sample size requirements for estimating intraclass correlations with desired precision. Statistics in Medicine, 21, 1331–1335, https://doi.org/10.1002/sim.1108. [PubMed]
Bossi, M., Tailor, V. K., Anderson, E. J., Bex, P. J., Greenwood, J. A., Dahlmann-Noor, A., & Dakin, S. C. (2017). Binocular Therapy for Childhood Amblyopia Improves Vision Without Breaking Interocular Suppression. Investigative Ophthalmology & Visual Science, 58, 3031, https://doi.org/10.1167/iovs.16-20913. [PubMed]
Cicchetti, D. V. (1994). Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychological Assessment, 6, 284–290, https://doi.org/10.1037/1040-3590.6.4.284.
Datta, D. (2024). blandr: Bland–Altman method comparison package for R. Retrieved from https://zenodo.org/records/11583535.
Ding, J., & Sperling, G. (2006). A gain-control theory of binocular combination. Proceedings of the National Academy of Sciences, USA, 103, 1141–1146, https://10.1073/pnas.0509629103.
Gamer, M., Lemon, J., Fellows, I., & Singh, P. (2022). Package ‘irr’: Various coefficients of interrater reliability and agreement. Retrieved from https://cran.r-project.org/web/packages/irr/irr.pdf.
García-Pérez, M. A. (2010). Denoising forced-choice detection data. British Journal of Mathematical and Statistical Psychology, 63, 75–100, https://doi.org/10.1348/000711009X424057.
Harrad, R. A., & Hess, R. F. (1992). Binocular integration of contrast information in amblyopia. Vision Research, 32, 2135–2150, https://doi.org/10.1016/0042-6989(92)90075-T. [PubMed]
Hess, R. F. (1991). The site and nature of suppression in squint amblyopia. Vision Research, 31, 111–117, https://doi.org/10.1016/0042-6989(91)90078-J. [PubMed]
Hess, R. F., & Bradley, A. (1980). Contrast perception above threshold is only minimally impaired in human amblyopia. Nature, 287, 463–464, https://doi.org/10.1038/287463a0. [PubMed]
Hess, R. F., Thompson, B., & Baker, D. H. (2014). Binocular vision in amblyopia: Structure, suppression and plasticity. Ophthalmic and Physiological Optics, 34, 146–162, https://doi.org/10.1111/opo.12123.
Huang, C. B., Zhou, J., Lu, Z. L., Feng, L., & Zhou, Y. (2009). Binocular combination in anisometropic amblyopia. Journal of Vision, 9(3):17, 1–16, https://doi.org/10.1167/9.3.17. [PubMed]
Huang, P.-C., Baker, D. H., & Hess, R. F. (2012). Interocular suppression in normal and amblyopic vision: Spatio-temporal properties. Journal of Vision, 12(11):29, 1–12, https://doi.org/10.1167/12.11.29.
Klink, P. C., Brascamp, J. W., Blake, R., & van Wezel, R. J. A. (2010). Experience-driven plasticity in binocular vision. Current Biology, 20, 1464–1469, https://doi.org/10.1016/j.cub.2010.06.057.
Kwon, M., Wiecek, E., Dakin, S. C., & Bex, P. J. (2015). Spatial-frequency dependent binocular imbalance in amblyopia. Scientific Reports, 5, 17181, https://doi.org/10.1038/srep17181. [PubMed]
Li, J., Thompson, B., Ding, Z., Chan, L. Y. L., Chen, X., Yu, M., … Hess, R. F. (2012). Does partial occlusion promote normal binocular function? Investigative Ophthalmology & Visual Science, 53, 6818, https://doi.org/10.1167/iovs.12-10390. [PubMed]
Li, J., Thompson, B., Lam, C. S. Y., Deng, D., Chan, L. Y. L., Maehara, G., … Hess, R. F. (2011). The role of suppression in amblyopia. Investigative Ophthalmology & Visual Science, 52, 4169, https://doi.org/10.1167/iovs.11-7233. [PubMed]
Maehara, G., Thompson, B., Mansouri, B., Farivar, R., & Hess, R. F. (2011). The perceptual consequences of interocular suppression in amblyopia. Investigative Ophthalmology & Visual Science, 52, 9011, https://doi.org/10.1167/iovs.11-7748. [PubMed]
Martín, S., Portela, J. A., Ding, J., Ibarrondo, O., & Levi, D. M. (2020). Evaluation of a virtual reality implementation of a binocular imbalance test. PLoS One, 15, e0238047, https://doi.org/10.1371/journal.pone.0238047. [PubMed]
May, K. A., & Georgeson, M. A. (2007). Blurred edges look faint, and faint edges look sharp: The effect of a gradient threshold in a multi-scale edge coding model. Vision Research, 47, 1705–1720, https://doi.org/10.1016/j.visres.2007.02.012. [PubMed]
Meese, T. S., Georgeson, M. A., & Baker, D. H. (2006). Binocular contrast vision at and above threshold. Journal of Vision, 6(11):7, 1224–1243, https://doi.org/10.1167/6.11.7. [PubMed]
Meier, K., Tarczy-Hornoch, K., Boynton, G. M., & Fine, I. (2023). Characterizing amblyopic perception under non-rivalrous viewing conditions. Scientific Reports, 13, 7993, https://doi.org/10.1038/s41598-023-31301-8. [PubMed]
Min, S. H., Gong, L., Baldwin, A. S., Reynaud, A., He, Z., Zhou, J., … Hess, R. F. (2021). Some psychophysical tasks measure ocular dominance plasticity more reliably than others. Journal of Vision, 21(8):20, 1–23, https://doi.org/10.1167/jov.21.8.20.
R Core Team. (2023). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.
Reynaud, A., & Hess, R. F. (2016). Is suppression just normal dichoptic masking? Suprathreshold considerations. Investigative Ophthalmology & Visual Science, 57, 5107, https://doi.org/10.1167/iovs.16-19682. [PubMed]
Reynaud, A., Lorenzini, M.-C., Koenekoop, R. K., Hess, R. F., & Baldwin, A. S. (2024). Amblyopic binocular imbalance quantified by the dichoptic contrast ordering test and dichoptic letters test. Manuscript submitted for publication.
Sauvan, L., Stolowy, N., Denis, D., Matonti, F., Chavane, F., Hess, R. F., … Reynaud, A. (2019). Contribution of short-time occlusion of the amblyopic eye to a passive dichoptic video treatment for amblyopia beyond the critical period. Neural Plasticity, 2019, 6208414, https://doi.org/10.1155/2019/6208414. [PubMed]
Shushruth, S., Nurminen, L., Bijanzadeh, M., Ichida, J. M., Vanni, S., & Angelucci, A. (2013). Different orientation tuning of near- and far-surround suppression in macaque primary visual cortex mirrors their tuning in human perception. Journal of Neuroscience, 33, 106–119, https://doi.org/10.1523/JNEUROSCI.2518-12.2013.
Spiegel, D. P., Baldwin, A. S., & Hess, R. F. (2016). The relationship between fusion, suppression, and diplopia in normal and amblyopic vision. Investigative Ophthalmology & Visual Science, 57, 5810, https://doi.org/10.1167/iovs.16-20438. [PubMed]
Tsuchiya, N., Koch, C., Gilroy, L. A., & Blake, R. (2006). Depth of interocular suppression associated with continuous flash suppression, flash suppression, and binocular rivalry. Journal of Vision, 6(10):6, 1068–1078, https://doi.org/10.1167/6.10.6. [PubMed]
Wang, X., Baldwin, A. S., & Hess, R. F. (2021). Balanced binocular inputs support superior stereopsis. Investigative Ophthalmology & Visual Science, 62, 10, https://doi.org/10.1167/iovs.62.12.10. [PubMed]
Wang, Y., He, Z., Liang, Y., Chen, Y., Gong, L., Mao, Y., … Hess, R. F. (2019). The binocular balance at high spatial frequencies as revealed by the binocular orientation combination task. Frontiers in Human Neuroscience, 13, 106, https://doi.org/10.3389/fnhum.2019.00106. [PubMed]
Watson, A. B., & Pelli, D. G. (1983). Quest: A Bayesian adaptive psychometric method. Perception & Psychophysics, 33, 113–120, https://doi.org/10.3758/BF03202828. [PubMed]
Webber, A. L., Mandall, T. R., Molloy, D. T., Lister, L. J., & Birch, E. E. (2020). Worth 4 Dot app for determining size and depth of suppression. Translational Vision Science & Technology, 9, 9, https://doi.org/10.1167/tvst.9.4.9. [PubMed]
Wixted, J. (2019). The forgotten history of signal detection theory. Journal of Experimental Psychology: Learning, Memory, and Cognition, 46, 201–233, https://doi.org/10.1037/xlm0000732. [PubMed]
Wolfe, J. M. (1986). Briefly presented stimuli can disrupt constant suppression and binocular rivalry suppression. Perception, 15, 413–417, https://doi.org/10.1068/p150413. [PubMed]
Zhou, J., Huang, P.-C., & Hess, R. F. (2013). Interocular suppression in amblyopia for global orientation processing. Journal of Vision, 13(5):19, 1–14, https://doi.org/10.1167/13.5.19. [PubMed]
Zhou, J., Reynaud, A., Yao, Z., Liu, R., Feng, L., Zhou, Y., … Hess, R. F. (2018). Amblyopic suppression: Passive attenuation, enhanced dichoptic masking by the fellow eye or reduced dichoptic masking by the amblyopic eye? Investigative Ophthalmology & Visual Science, 59, 4190, https://doi.org/10.1167/iovs.18-24206. [PubMed]
Appendix A: Table of suppression measurement methods
  
Table A1.
 
Results of a review of existing methods for measuring interocular suppression.
Table A1.
 
Results of a review of existing methods for measuring interocular suppression.
Appendix B: Algorithm details
The goal of the measurement is to estimate the underlying interocular contrast ratio that defines the imbalance in the perceived contrast between the eyes. This is found as the most likely value of the interocular ratio parameter (r) based on the history of ranking responses given to the different stimulus conditions:  
\begin{eqnarray}{\rm{log\ L(}}{r_i}) = \mathop \sum \limits_j {\rm{log\ L}}({r_i}|{\rm{conditio}}{{\rm{n}}_j},{\rm{respons}}{{\rm{e}}_j})\qquad \end{eqnarray}
(A1)
where log L(ri) is the overall log-likelihood of the ith parameter value, calculated as the sum of the log-likelihoods based on the responses given to the conditions presented over each j of the trials. These likelihood values are obtained from pre-generated look-up tables. After converting from log-likelihood back to likelihood, we normalize to give L′:  
\begin{eqnarray} {\bf{L}}^{\prime}(r_{i}) = {\bf{L}}(r_{i}) \left/ \mathop \sum\nolimits_i {\bf{L}}(r_{i})\right.\qquad \end{eqnarray}
(A2)
where the sum of L′ over each i of the possible ratios is 1. This L′(ri) is our running estimate of the likelihood of each of our discretely-sampled parameter values ri. The algorithm attempts to estimate the most likely of these parameter values. This is done by minimizing the expected posterior entropy. The entropy given all previous trials is defined as  
\begin{eqnarray} {\rm{H}}({\boldsymbol{r}}) = - \mathop \sum \limits_i {{\bf L^{\prime}}}({r_i})\ \times {\rm{\ log\ }}{{\bf L^{\prime}}}({r_i})\ \qquad \end{eqnarray}
(A3)
 
To obtain the expected posterior entropy, we must consider adding one more trial and account for the various possible responses that the participant is more or less probable to give based on the current best estimate of their interocular ratio. To find the expected entropy resulting from the j + 1 trial, we determine the normalized likelihood function that would be the outcome for each possible ranking response in that trial. We use Bayes’ theorem, simply  
\begin{eqnarray} {{\bf L^{\prime}}}\left( {r{\rm{|rank}}} \right) = {{{\rm{P}}({\rm{rank}}|r)\ \times {{\bf L^{\prime}}}\left( r \right)}} \big/ {{{\rm{P}}\left( {{\rm{rank}}} \right)}}\qquad \end{eqnarray}
(A4)
where L′(r) is the prior likelihood of the ratio value, and P(rank) is the marginal probability of the ranking outcome. This probability, P(rank), of giving a certain (kth) ranking response has to take into account the relative likelihoods of the different interocular ratio parameter options:  
\begin{eqnarray} {\rm{P(ran}}{{\rm{k}}_k}|{\rm{conditio}}{{\rm{n}}_{j + 1}}) = \mathop \sum \limits_i {{\bf L^{\prime}}}({r_i}) \times {\rm{\ P(ran}}{{\rm{k}}_k}|{r_i},{\rm{conditio}}{{\rm{n}}_{j + 1}})\qquad \end{eqnarray}
(A5)
where the probability of a certain ranking being given to a particular condition under a particular interocular ratio parameter, P(rankk|ri, conditionj+1), can be obtained directly from the Monte Carlo–generated look-up tables. 
Substituting the appropriate values into Equation A4, we have  
\begin{eqnarray} {\bf{L}}^{\prime}({r_i}|{\rm{condition}}_{j + 1},{\rm{rank}}_{k}) = {\rm{P}}({\rm{rank}}_{k}|r_{i},{\rm{condition}}_{j + 1}) \times \left. {\bf{L}}^{\prime}(r_{i})\right/ {\rm{P}}({\rm{rank}}_{k}|{\rm{condition}}_{j + 1}) \qquad \end{eqnarray}
(A6)
 
The conditional entropy of the likelihood function from that single trial outcome (a specific ranking response given to the presented condition) is then calculated as  
\begin{eqnarray} {\bf{H}}_{j + 1}({\boldsymbol{r}}|{\rm{condition}}_{j + 1},{\rm{rank}}_{k}) = - \mathop\sum\limits_i {\bf{L}}^{\prime}(r_{i}|{\rm{condition}}_{j + 1},{\rm{rank}}_{k}) \times {\rm{log}}\ {\bf{L}}^{\prime}(r_{i}|{\rm{condition}}_{j + 1},{\rm{ran}}{\rm{k}}_{k})\qquad \end{eqnarray}
(A7)
 
The expected entropy resulting from the j + 1 trial given a certain condition is the weighted average of the entropies calculated from the likelihood functions for each k of the different possible ranking responses. They are weighted by the probability of each ranking response being given  
\begin{eqnarray} {{\bf H}}_{j + 1}({\boldsymbol{r}}|{\rm{conditio}}{{\rm{n}}_{j + 1}}) = \mathop \sum \limits_k {{{\bf H}}_{j + 1}}({\boldsymbol{r}}|{\rm{conditio}}{{\rm{n}}_{j + 1}},{\rm{ran}}{{\rm{k}}_k}) \times {\rm{P(ran}}{{\rm{k}}_k}|{\rm{conditio}}{{\rm{n}}_{j + 1}}) \qquad \end{eqnarray}
(A8)
 
The conditions for the next trial (conditionj+1) are therefore chosen to minimize the value of Equation A8. As we consider only a relatively small set of possible conditions, we are able to calculate this value for all of the candidates and select the one that produces the smallest value. 
Appendix C: Supplementary analysis
In addition to the combined analysis given in Figure 4, we also analyzed the test–retest data for each individual filter or lens condition. These data are plotted in Figure A1, and the statistics for these individual conditions are reported in Table 1
Figure A1.
 
Test–retest reliability. The data of Figure 4 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show the DiCOT measurements. The remaining two columns show data from the DLT. In each pair, the left column shows correlations between test and retest measurements. The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition. The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI.
Figure A1.
 
Test–retest reliability. The data of Figure 4 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show the DiCOT measurements. The remaining two columns show data from the DLT. In each pair, the left column shows correlations between test and retest measurements. The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition. The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI.
When comparing the two measurement tools (DLT vs. DiCOT), we also analyzed the data from the individual conditions separately. These data are presented in Figure A2. The regression slopes from these plots are summarized in Table 2, and the complete analysis is reported in Table A2. A significant linear relationship was found between the DLT and DiCOT results for all conditions except no filter/lens due to the relatively small variation in binocular imbalance in our population. As in Figure 5, because the slope of the relationship in the scatterplots is not unity, we find a strong relationship between the DLT result value and the DLT–DiCOT error in the Bland–Altman analysis (where the slope of that relationship is equal to one minus the slope from the scatterplot). 
Figure A2.
 
Comparison between DLT and DiCOT. The data of Figure 5 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show comparisons of the test (first) measurements. The remaining two columns show comparisons of the retest (second) measurements. In each pair, the left column shows scatterplots comparing the measurement made with one test against that made with the other. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text).
Figure A2.
 
Comparison between DLT and DiCOT. The data of Figure 5 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show comparisons of the test (first) measurements. The remaining two columns show comparisons of the retest (second) measurements. In each pair, the left column shows scatterplots comparing the measurement made with one test against that made with the other. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text).
Table A2.
 
Comparison between DLT and DiCOT, with statistics from Figure A2, giving the consistency form of the ICC, as well as the linear regression analysis, the bias and LoA from the Bland–Altman plots, and the linear regression analysis from the Bland–Altman plots.
Table A2.
 
Comparison between DLT and DiCOT, with statistics from Figure A2, giving the consistency form of the ICC, as well as the linear regression analysis, the bias and LoA from the Bland–Altman plots, and the linear regression analysis from the Bland–Altman plots.
In Figure 6, we transformed the DiCOT results according to the overall linear relationship found with the DLT data. Here, we report an additional analysis where this transformation was performed separately for each filter or lens condition (note that the no filter/lens condition is excluded as there was no significant relationship to base the transformation on). This analysis is shown in Figure A3, and the statistics are reported in Table A3. The transformation accounts for both the slope and bias of the relationships reported in the scatterplots in Figure A2. Therefore, after the transformation, the scatterplot relationship is, by definition, y = x. There is no bias on the Bland–Altman plot and the slope of the regression in that plot is zero. The consistency ICC score is improved by this adjustment, although there remains a wide range of LoA on the Bland–Altman plot. 
Figure A3.
 
Comparison between DLT and adjusted DiCOT measurements. Similar to Figure A2, but with DiCOT values adjusted according to the linear relationships reported in Figure A2.
Figure A3.
 
Comparison between DLT and adjusted DiCOT measurements. Similar to Figure A2, but with DiCOT values adjusted according to the linear relationships reported in Figure A2.
Table A3.
 
Comparison between DLT and adjusted DiCOT measurements. Statistics are from Figure A3, giving the consistency ICC, as well as the bias and the LoA from the Bland–Altman plots.
Table A3.
 
Comparison between DLT and adjusted DiCOT measurements. Statistics are from Figure A3, giving the consistency ICC, as well as the bias and the LoA from the Bland–Altman plots.
Figure 1.
 
Stimulus display for a single DiCOT trial. (A) Screenshot of a trial with an arrangement of left eye stimuli (in red) and right eye stimuli (in cyan), with each set having a high, medium, and low contrast stimulus determined by the algorithm to provide information on the binocular imbalance of the participant. Depending on the degree of imbalance, in some cases the lower contrast stimuli may not be visible to the participant. (B) Illustration of the look-up tables (generated by Monte Carlo simulation, one per stimulus condition). Each column is a probability distribution giving the probability of a particular ranking response being made for that stimulus condition by an individual with that interocular ratio. Each row is a likelihood function, where the ranking made in response to a stimulus condition indicates the likelihood of the participant's visual system being balanced by some interocular ratio parameter (from –18 to +18 dB).
Figure 1.
 
Stimulus display for a single DiCOT trial. (A) Screenshot of a trial with an arrangement of left eye stimuli (in red) and right eye stimuli (in cyan), with each set having a high, medium, and low contrast stimulus determined by the algorithm to provide information on the binocular imbalance of the participant. Depending on the degree of imbalance, in some cases the lower contrast stimuli may not be visible to the participant. (B) Illustration of the look-up tables (generated by Monte Carlo simulation, one per stimulus condition). Each column is a probability distribution giving the probability of a particular ranking response being made for that stimulus condition by an individual with that interocular ratio. Each row is a likelihood function, where the ranking made in response to a stimulus condition indicates the likelihood of the participant's visual system being balanced by some interocular ratio parameter (from –18 to +18 dB).
Figure 2.
 
Post-response analysis performed in the DiCOT. (A) The ranking given by the participant selects a row in the look-up table for that trial. (B) The likelihoods from this row are converted to log values before being added to the cumulative history of the log likelihoods from all previous trials. The curves in this illustration show relative log likelihoods of the different parameter values based on the cumulative history up to either trial (i – 1), in gray, or up to trial i, in dark blue. This example demonstrates the possible impact of adding trial i, which is to sharpen the estimate of the best parameter value, as well as to vertically translate the function downward.
Figure 2.
 
Post-response analysis performed in the DiCOT. (A) The ranking given by the participant selects a row in the look-up table for that trial. (B) The likelihoods from this row are converted to log values before being added to the cumulative history of the log likelihoods from all previous trials. The curves in this illustration show relative log likelihoods of the different parameter values based on the cumulative history up to either trial (i – 1), in gray, or up to trial i, in dark blue. This example demonstrates the possible impact of adding trial i, which is to sharpen the estimate of the best parameter value, as well as to vertically translate the function downward.
Figure 3.
 
Binocular imbalance without lens/filter and under three lens/filter conditions. The bar graphs represent the mean interocular suppression ratio parameter at baseline and in the three conditions where the filter or lens is placed over either the left (OS) or right eye (OD). (A) Results from the DiCOT. (B) Results from the DLT. Error bars represent the bootstrapped 95% CI of the mean. Individual participant data are shown for each bar with the cross symbols.
Figure 3.
 
Binocular imbalance without lens/filter and under three lens/filter conditions. The bar graphs represent the mean interocular suppression ratio parameter at baseline and in the three conditions where the filter or lens is placed over either the left (OS) or right eye (OD). (A) Results from the DiCOT. (B) Results from the DLT. Error bars represent the bootstrapped 95% CI of the mean. Individual participant data are shown for each bar with the cross symbols.
Figure 4.
 
Test–retest reliability. The left column shows correlations between first and second (test and retest) measurements obtained using the DiCOT (A) and the DLT (C). The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition (84 in total for the DiCOT, 82 in the DLT due to cases where a measurement result could not be obtained). The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The Bland–Altman plot for the DiCOT is given in B, corresponding to the test-retest plot in A. The Bland–Altman plot for the DLT is given in D, corresponding to the test-retest plot in C. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The results for each filter or lens condition are shown individually in Figure A1, and the statistics are reported in Table 1.
Figure 4.
 
Test–retest reliability. The left column shows correlations between first and second (test and retest) measurements obtained using the DiCOT (A) and the DLT (C). The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition (84 in total for the DiCOT, 82 in the DLT due to cases where a measurement result could not be obtained). The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The Bland–Altman plot for the DiCOT is given in B, corresponding to the test-retest plot in A. The Bland–Altman plot for the DLT is given in D, corresponding to the test-retest plot in C. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The results for each filter or lens condition are shown individually in Figure A1, and the statistics are reported in Table 1.
Figure 5.
 
Comparison between DLT and DiCOT. The left column shows scatterplots comparing the measurement made with one test against that made with the other. The top row (A) compares the test (first) measurements, and the bottom (C) compares the second (retest) measurements. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots (B for the first measurements, D for the second measurements). The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The results for each filter or lens condition are shown individually in Figure A2, with statistics reported in part in Table 2 and in full in Table A1.
Figure 5.
 
Comparison between DLT and DiCOT. The left column shows scatterplots comparing the measurement made with one test against that made with the other. The top row (A) compares the test (first) measurements, and the bottom (C) compares the second (retest) measurements. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots (B for the first measurements, D for the second measurements). The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The results for each filter or lens condition are shown individually in Figure A2, with statistics reported in part in Table 2 and in full in Table A1.
Figure 6.
 
Comparison between DLT and adjusted DiCOT measurements. Panel A shows the relationship between the two measurements for the first (test) measurements, with the Bland–Altman plot shown in B. Panel C shows the relationship for the second (retest) measurements with the Bland–Altman plot shown in D. Similar to Figure 5; however, in this case the DiCOT values are adjusted according to the best-fitting linear relationships shown in green in Figure 5A (for the test measurements) and Figure 5C (for the retest measurements). The results for each filter or lens condition are shown individually in Figure A3, with statistics reported in-part in Table 2 and in-full in Table A2.
Figure 6.
 
Comparison between DLT and adjusted DiCOT measurements. Panel A shows the relationship between the two measurements for the first (test) measurements, with the Bland–Altman plot shown in B. Panel C shows the relationship for the second (retest) measurements with the Bland–Altman plot shown in D. Similar to Figure 5; however, in this case the DiCOT values are adjusted according to the best-fitting linear relationships shown in green in Figure 5A (for the test measurements) and Figure 5C (for the retest measurements). The results for each filter or lens condition are shown individually in Figure A3, with statistics reported in-part in Table 2 and in-full in Table A2.
Figure A1.
 
Test–retest reliability. The data of Figure 4 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show the DiCOT measurements. The remaining two columns show data from the DLT. In each pair, the left column shows correlations between test and retest measurements. The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition. The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI.
Figure A1.
 
Test–retest reliability. The data of Figure 4 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show the DiCOT measurements. The remaining two columns show data from the DLT. In each pair, the left column shows correlations between test and retest measurements. The solid diagonal line indicates equality. Each point indicates a data point from one participant in one lens/filter condition. The right column shows Bland–Altman plots giving the test–retest difference as a function of the mean of the two measures. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI.
Figure A2.
 
Comparison between DLT and DiCOT. The data of Figure 5 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show comparisons of the test (first) measurements. The remaining two columns show comparisons of the retest (second) measurements. In each pair, the left column shows scatterplots comparing the measurement made with one test against that made with the other. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text).
Figure A2.
 
Comparison between DLT and DiCOT. The data of Figure 5 are replotted here, separated into the different filter or lens conditions (rows). The first two columns show comparisons of the test (first) measurements. The remaining two columns show comparisons of the retest (second) measurements. In each pair, the left column shows scatterplots comparing the measurement made with one test against that made with the other. The diagonal green line is the best-fitting linear relationship (parameters given in green text). The black line shows equality. The right column shows Bland–Altman plots derived from the scatterplots. The three black lines show the bias and the upper and lower LoA. The shaded region around each line gives the 95% CI. The diagonal green line is the best-fitting linear relationship (parameters given in green text).
Figure A3.
 
Comparison between DLT and adjusted DiCOT measurements. Similar to Figure A2, but with DiCOT values adjusted according to the linear relationships reported in Figure A2.
Figure A3.
 
Comparison between DLT and adjusted DiCOT measurements. Similar to Figure A2, but with DiCOT values adjusted according to the linear relationships reported in Figure A2.
Table 1.
 
Test–retest reliability statistics shown in Figure 4 and Figure A1. The agreement form of the ICC is reported, as well as the bias and LoA from the Bland–Altman plots.
Table 1.
 
Test–retest reliability statistics shown in Figure 4 and Figure A1. The agreement form of the ICC is reported, as well as the bias and LoA from the Bland–Altman plots.
Table 2.
 
Consistency ICC calculated for the individual lens/filter conditions (as shown in Figure 5), in each case combining the OS and OD data. As there was no significant linear relationship for the no filter/lens condition, the data were not included for the individual analysis with the adjusted DiCOT.
Table 2.
 
Consistency ICC calculated for the individual lens/filter conditions (as shown in Figure 5), in each case combining the OS and OD data. As there was no significant linear relationship for the no filter/lens condition, the data were not included for the individual analysis with the adjusted DiCOT.
Table A1.
 
Results of a review of existing methods for measuring interocular suppression.
Table A1.
 
Results of a review of existing methods for measuring interocular suppression.
Table A2.
 
Comparison between DLT and DiCOT, with statistics from Figure A2, giving the consistency form of the ICC, as well as the linear regression analysis, the bias and LoA from the Bland–Altman plots, and the linear regression analysis from the Bland–Altman plots.
Table A2.
 
Comparison between DLT and DiCOT, with statistics from Figure A2, giving the consistency form of the ICC, as well as the linear regression analysis, the bias and LoA from the Bland–Altman plots, and the linear regression analysis from the Bland–Altman plots.
Table A3.
 
Comparison between DLT and adjusted DiCOT measurements. Statistics are from Figure A3, giving the consistency ICC, as well as the bias and the LoA from the Bland–Altman plots.
Table A3.
 
Comparison between DLT and adjusted DiCOT measurements. Statistics are from Figure A3, giving the consistency ICC, as well as the bias and the LoA from the Bland–Altman plots.
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×