June 2023
Volume 23, Issue 6
Open Access
Methods  |   June 2023
Collective endpoint of visual acuity and contrast sensitivity function from hierarchical Bayesian joint modeling
Author Affiliations
  • Yukai Zhao
    Center for Neural Science, New York University, New York, NY, USA
    zhaoyukai@nyu.edu
  • Luis Andres Lesmes
    Adaptive Sensory Technology Inc., San Diego, CA, USA
    luis.lesmes@adaptivesensorytech.com
  • Michael Dorr
    Adaptive Sensory Technology Inc., San Diego, CA, USA
    michael.dorr@adaptivesensorytech.com
  • Zhong-Lin Lu
    Division of Arts and Sciences, NYU Shanghai, Shanghai, China
    Center for Neural Science and Department of Psychology, New York University, New York, NY, USA
    NYU-ECNU Institute of Brain and Cognitive Neuroscience, Shanghai, China
    zhonglin@nyu.edu
Journal of Vision June 2023, Vol.23, 13. doi:https://doi.org/10.1167/jov.23.6.13
  • Views
  • PDF
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Yukai Zhao, Luis Andres Lesmes, Michael Dorr, Zhong-Lin Lu; Collective endpoint of visual acuity and contrast sensitivity function from hierarchical Bayesian joint modeling. Journal of Vision 2023;23(6):13. https://doi.org/10.1167/jov.23.6.13.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Clinical trials typically analyze multiple endpoints for signals of efficacy. To improve signal detection for treatment effects using the high-dimensional data collected in trials, we developed a hierarchical Bayesian joint model (HBJM) to compute a five-dimensional collective endpoint (CE5D) of contrast sensitivity function (CSF) and visual acuity (VA). The HBJM analyzes row-by-row CSF and VA data across multiple conditions, and describes visual functions across a hierarchy of population, individuals, and tests. It generates joint posterior distributions of CE5D that combines CSF (peak gain, peak frequency, and bandwidth) and VA (threshold and range) parameters. The HBJM was applied to an existing dataset of 14 eyes, each tested with the quantitative VA and quantitative CSF procedures in four Bangerter foil conditions. The HBJM recovered strong correlations among CE5D components at all levels. With 15 qVA and 25 qCSF rows, it reduced the variance of the estimated components by 72% on average. Combining signals from VA and CSF and reducing noises, CE5D exhibited significantly higher sensitivity and accuracy in discriminating performance differences between foil conditions at both the group and test levels than the original tests. The HBJM extracts valuable information about covariance of CSF and VA parameters, improves precision of the estimated parameters, and increases the statistical power in detecting vision changes. By combining signals and reducing noise from multiple tests for detecting vision changes, the HBJM framework exhibits potential to increase statistical power for combining multi-modality data in ophthalmic trials.

Introduction
As research studies that test new methods of prevention, detection, treatment, and care in healthcare, clinical trials often consist of five phases, early phase I and phases I–IV, with each phase designed to test a different aspect of the new drug, device, or method (Rosenberger & Haines, 2002; Thall, 2008; Torres-Saavedra & Winter, 2022; Umscheid, Margolis, & Grossman, 2011). Although early phase I trials test the initial safety and potency of the new treatment, phase I trials aim to determine how effective the new drug, treatment, or device is and how the human body processes it, phase II trials investigate if the new treatment leads to prolonged life or better health outcomes, phase III trials compare the new treatment to existing standard treatment, and phase IV trials evaluate the new treatment after approval from the Food and Drug Administration. Although clinical trials often involve hundreds or even thousands of subjects, the sample size in most early phase I and phase I trials is about 10 to 20 (Cook, Hansen, Siu, & Abdul, 2015; Rosenberger & Haines, 2002; Storer, 1989; Umscheid et al., 2011). 
Endpoints are measured in clinical trials to evaluate the efficacy of a new treatment. An endpoint is an event or outcome that can be measured objectively to determine whether the treatment being studied is beneficial. In ophthalmic trials, the endpoint could be a clinical event (e.g., retinal hemorrhage, stroke), patient symptom (e.g., pain), visual function (e.g., visual acuity, contrast sensitivity), retinal structure (e.g., optical coherence tomography, fundus photography), or an equivalent surrogate (McLeod et al., 2019; US Department of Health and Human Services et al., 1998; US Department of Health and Human Services et al., 2022). To obtain multi-faceted assessment of treatment outcomes, most clinical trials collect multiple endpoints and analyze them independently (but see composite endpoint; Kramer et al., 2006; McLeod et al., 2019; US Department of Health and Human Services et al., 1998; US Department of Health and Human Services et al., 2022). 
In this study, we attempted to address an important issue in early-stage retina trials with multiple functional vision endpoints, in which statistical power for detecting vision gains based on each endpoint is limited by the variance caused by small sample size, noisy measurements, and variable treatment response (Abou-Hanna, Andrews, Khan, Musch, & Jayasundera, 2021; Allingham, Mettu, & Cousins, 2022; Cukras et al., 2018; da Cruz et al., 2018; de Guimaraes, Georgiou, Bainbridge, & Michaelides, 2021; Reichel et al., 2022). We developed a hierarchical Bayesian joint model (HBJM) to exploit the relationship between two endpoints, visual acuity (VA) and contrast sensitivity function (CSF), and computed a five-dimensional collective endpoint (CE5D) that consists of two parameters of the visual acuity behavioral function (VABF) (Lesmes & Dorr, 2019) and three parameters of contrast sensitivity function (Watson & Ahumada, 2005). CE5D is considered as a “collective endpoint” because it combines information from two endpoint assessments. We demonstrated that the high dimensional collective endpoint significantly improved the statistical power in a study with a small sample size (N = 14 eyes). 
Visual acuity and contrast sensitivity function tests are designed to assess different aspects of vision. While VA focuses on the spatial resolution of vision that is important for resolving details in tasks such as reading (Whittaker & Lovie-Kitchin, 1993), CSF measures the ability to detect subtle light changes against the background that is important for tasks such as mobility (Lord, 2006) and driving (Owsley & McGwin, 2010). It's also well known that many visual diseases affect CSF and VA differentially, with reports of CSF deficits but normal or near-normal VA in early stages of many clinical conditions (Elliott & Situ, 1998; Hawkins, Szlyk, Ardickas, Alexander, & Wilensky, 2003; Hou et al., 2010; Jia, Zhou, Lu, Lesmes, & Huang, 2015; Lesmes et al., 2016; Midena, Angeli, Blarzino, Valenti, & Segato, 1997; Onal, Yenice, Cakir, & Temel, 2008; Ou, Lesmes, Christie, Denlar, & Csaky, 2021; Ramulu, Dave, & Friedman, 2015; Shandiz et al., 2011; Stellmann, Young, Pottgen, Dorr, & Heesen, 2015; Vingopoulos et al., 2021; Wai et al., 2021; Wilensky & Hawkins, 2001; Zimmern, Campbell, & Wilkinson, 1979). On the other hand, VA and CS both reflect properties of the same visual system for an individual, and there are many reports of significant correlations between VA and CSF in normal older adults and clinical populations (Alexander, Derlacki, & Fishman, 1995; Bellmann, Unnebrink, Rubin, Miller, & Holz, 2003; Brown & Lovie-Kitchin, 1989; Cormack, Tovee, & Ballard, 2000; Elliott & Situ, 1998; Haegerstrom-Portnoy Schneck, & Brabyn, 1999; Kiser, Mladenovich, Eshraghi, Bourdeau, & Dagnelie, 2005; Roark & Stringham, 2019; Rubin, Roche, Prasada-Rao, & Fried, 1994; West et al., 2002). Although one could ultimately construct a unified model in which VA and CSF are determined by both shared and independent parameters that are affected differentially by different diseases and in different disease stages, much more basic and clinical research is necessary. Here, we take an alternative approach based on the covariance joint modeling framework (Palestro et al., 2018; Turner, 2015; Turner et al., 2013; Turner, Van Maanen, & Forstmann, 2015; Turner, Rodriguez, Norcia, McClure, & Steyvers, 2016; Turner, Wang, & Merkle, 2017) in which VA and CSF are modeled separately but with covariances between parameters of the models. 
Most experimental designs follow a hierarchical structure (Kim, Pitt, Lu, Steyvers, & Myung, 2014; Yin, Qin, Sargent, Erlichman, & Shi, 2018). At the top, there is the study population, which can be divided into different treatment arms. Each arm consists of multiple subjects that are tested with multiple assessments, and each assessment could be repeated multiple times. While many common statistical methods assume either homogeneity (e.g., fixed effects) or complete independence across subjects or tests, hierarchical modeling is a statistical method that has been developed to quantitatively and coherently combine heterogeneous information (Kruschke, 2014; Rouder & Lu, 2005). Constructed by combining sub-models and probability distributions at multiple levels that represent relationships between information arising within and between levels, hierarchical models are often computed with Bayes’ theorem from all the observed data in a study (rather than data from a subject or a test) (Kruschke, 2014; Kruschke & Liddell, 2018). The result is an updated joint posterior distribution of all the hyperparameters and parameters (Gu et al., 2016; Kruschke, 2014; Zhao, Lesmes, Dorr, & Lu, 2021). By sharing information within and across levels via conditional dependencies, hierarchical Bayesian modeling (HBM) can reduce the variance of the estimated posterior distributions through (1) decomposition of variabilities from different sources with parameters and hyperparameters (Song, Behmanesh, Moaveni, & Papadimitriou, 2020), and (2) shrinkage of the estimated parameters at the lower levels toward the modes of the higher levels when there is not sufficient data at the lower level (Kruschke, 2014; Rouder, Sun, Speckman, Lu, & Zhou, 2003; Rouder & Lu, 2005). 
Hierarchical Bayesian models (HBMs) have been developed and widely applied in cognitive science, including signal detection (Rouder et al., 2003; Rouder & Lu, 2005), decision making (Lee, 2006; Merkle, Smithson, & Verkuilen, 2011), and functional magnetic resonance imaging (Ahn, Krawitz, Kim, Busmeyer, & Brown, 2011; Wilson, Cranmer, & Lu, 2020). HBJM has been developed in cognitive neuroscience to investigate brain-behavior relationships (Molloy et al., 2018; Molloy, Bahg, Lu, & Turner, 2019; Palestro et al., 2018). As an alternative to the two-stage correlation approach in cognitive neuroscience, where estimated parameters of a cognitive model on human behavior are simply correlated with a neural measure of interest, joint modeling enforces constraints on the parameters of the cognitive model using the neural data. It has been used to perform inference on brain-behavior relationships, improve the precision of the estimated model parameters, and make predictions about either neural or behavioral data (Cassey, Gaut, Steyvers, & Brown, 2016; Turner, 2015; Turner et al., 2013; Turner et al., 2015; Turner et al., 2016; Turner et al., 2017). Although we have previously developed HBMs to estimate parameters of the VABF and CSF models within each type of measurement and significantly improved the accuracy and precision of the estimates (Zhao, Lesmes, Dorr, & Lu, 2021; Zhao, Lesmes, Hou, & Lu, 2021), we used HBJM to model the relationships between VA and CSF in this study. Our goals were to (1) compute a multi-dimensional collective endpoint (CE5D) of CSF and VA by modeling row-by-row VA and CSF data simultaneously, and (2) exploit the statistically reciprocal relationships between VA and CSF parameters to combine signals from VA and CSF tests and reduce the variability of the estimated endpoints and therefore improve the power of statistical inference (Figure 1). 
Figure 1.
 
Illustration of a two-dimensional CE. (A) The gray iso-density contours represent two-dimensional distributions of a VA-CSF CE in two conditions. The blue and purple curves represent VA and CSF distributions computed from the BIP. The black curves represent the marginal distributions of CE components on the VA and CSF axes, and the projection of CE on the decision axis. The p values for detecting the change between the two conditions are 0.15 based on VA and CSF from BIP, 0.10 based on VA and CSF from CE, and 0.006 based on CE. (B) Signals from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (C) Noise from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (D) Signal-to-noise ratios from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black).
Figure 1.
 
Illustration of a two-dimensional CE. (A) The gray iso-density contours represent two-dimensional distributions of a VA-CSF CE in two conditions. The blue and purple curves represent VA and CSF distributions computed from the BIP. The black curves represent the marginal distributions of CE components on the VA and CSF axes, and the projection of CE on the decision axis. The p values for detecting the change between the two conditions are 0.15 based on VA and CSF from BIP, 0.10 based on VA and CSF from CE, and 0.006 based on CE. (B) Signals from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (C) Noise from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (D) Signal-to-noise ratios from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black).
We began with generative models of VA and CSF that, with appropriate parameters, can reproduce the row-by-row data from the measurements. We used the VABF with two parameters, VA threshold and VA range, as the generative model of observer performance in VA assessment (Figure 2A) (Lesmes & Dorr, 2019), and a log parabola with three parameters, peak gain (PG), peak spatial frequency (PF) and bandwidth at half-height (BH) (Figure 2C) (Watson & Ahumada, 2005), as the generative model of observer performance in CSF assessment. Both models have previously been used to model and reproduce row-by-row data in their respective measurements (Figures 2B, 2E) (Hou, Lesmes, Bex, Dorr, & Lu, 2015; Hou et al., 2016; Lesmes, Lu, Baek, & Albright, 2010; Zhao, Lesmes, Dorr, Bex, & Lu, 2021; Zhao, Lesmes, Dorr, & Lu, 2021; Zhao, Lesmes, Hou, et al., 2021). We defined CE5D as a five-dimensional vector consisting of VA threshold, VA range, PG, PF, and BH. 
Figure 2.
 
(A) The VABF with two parameters: threshold \(\theta _{ij,Threshold}^{VA}\;\)and range \(\theta _{ij,Range}^{VA}\). (B) Simulated row-by-row data generated by the VABF model in a VA assessment. (C) The log parabola model of the CSF with three parameters: peak gain \(\theta _{ij,PG}^{CSF}\), peak spatial frequency \(\theta _{ij,PF}^{CSF}\), and bandwidth \(\theta _{ij,BH}^{CSF}\). (D) Psychometric functions at different spatial frequencies in a CSF test. (E) Simulated row-by-row data generated by the CSF model in a CSF assessment.
Figure 2.
 
(A) The VABF with two parameters: threshold \(\theta _{ij,Threshold}^{VA}\;\)and range \(\theta _{ij,Range}^{VA}\). (B) Simulated row-by-row data generated by the VABF model in a VA assessment. (C) The log parabola model of the CSF with three parameters: peak gain \(\theta _{ij,PG}^{CSF}\), peak spatial frequency \(\theta _{ij,PF}^{CSF}\), and bandwidth \(\theta _{ij,BH}^{CSF}\). (D) Psychometric functions at different spatial frequencies in a CSF test. (E) Simulated row-by-row data generated by the CSF model in a CSF assessment.
Next, we developed an HBJM with population, individual, and test levels to model the row-by-row data from both VA and CSF assessments on all individuals in an experiment within a single structure and compute the CE parameter and hyperparameter distributions at all three levels, taking into account of the hierarchical experimental design, covariance of CE components, and conditional dependency across different levels (Kruschke, 2014; Zhao, Lesmes, Dorr, & Lu, 2021; Zhao, Lesmes, Hou, et al., 2021). The new development enables us to compute the joint distributions of the five-dimensional CE5D from the row-by-row data of human observers in all the tests and make statistical inferences at the population, individual, and test levels. 
We applied the HBJM to an existing experimental dataset (Zhao, Lesmes, Dorr, Bex, et al., 2021; Zhao, Lesmes, Dorr, & Lu, 2021) with 14 eyes, each tested with the quantitative VA (qVA) (Lesmes & Dorr, 2019; Zhao, Lesmes, Dorr, & Lu, 2021) and quantitative CSF (qCSF) (Hou et al., 2015; Lesmes et al., 2010) procedures in four Bangerter foil conditions, and computed CE5D distribution for each eye in each condition. We quantified the uncertainty, sensitivity, and discriminability of the estimated CE5D, and compared them with metrics from Bayesian inference procedure (BIP) analyses of the tests, VA1D and AULCSF1D
Methods
Data
The dataset consisted of 14 eyes of seven subjects with normal or corrected-to-normal vision, each tested with qVA and qCSF in four conditions: no foil (F0) and three levels of Bangerter foils (Ryser Ophtalmologie, St. Gallen, Switzerland) with nominal acuities of 20/25 (F1), 20/30 (F2), and 20/100 (F3). The qVA data were included in previous publications (Zhao, Lesmes, Dorr, Bex, et al., 2021; Zhao, Lesmes, Dorr, & Lu, 2021). The qCSF data have not been published. The test order of the four conditions was randomized across subjects. Each qVA test consisted of 45 rows (M = 45), each with three equal-size high-contrast optotypes (Figure 3A) randomly sampled without replacement from the 10 Sloan letters (C, D, H, K, N, O, R, S, V, and Z). The size of the optotypes was determined by qVA (Lesmes & Dorr, 2019; Zhao, Lesmes, Dorr, Bex, et al., 2021). Each qCSF test consisted of 50 rows (K = 50), each with three equal-size bandpass-filtered optotypes (Figure 3B) randomly sampled with replacement from the ten Sloan letters (C, D, H, K, N, O, R, S, V, and Z). The sizes (and therefore center spatial frequencies) and contrasts of the optotypes were determined by qCSF (Hou et al., 2015; Lesmes et al., 2010). In both tests, the stimuli stayed on the screen until the subject verbally reported the identities of the letters presented on the screen, which were entered into the computer by the experimenter. 
Figure 3.
 
Illustrations of the test stimuli in one row used in the (A) qVA and (B) qCSF.
Figure 3.
 
Illustrations of the test stimuli in one row used in the (A) qVA and (B) qCSF.
Apparatus
Both qVA and qCSF were implemented in MATLAB (MathWorks Corp., Natick, MA, USA). A 24-inch Dell P2415Q liquid-crystal display monitor at a 3840 × 2160 pixel resolution and a 55-inch Samsung UN55FH6030 monitor at a 1920 × 1080 pixel resolution were used for the qVA and qCSF tests, respectively. Subjects viewed the displays monocularly at four meters. The untested eye was covered with an opaque patch. A Dell computer with Intel Xeon W-2145 @ 3.70GHz CPU (8 cores and 16 threads) and 64GB installed memory (RAM) was used to conduct all analyses with JAGS (Plummer, 2003) in R (R Core Team, 2003). 
Hierarchical Bayesian joint modeling of VA and CSF assessments
In this section, we first describe the generative models of VA and CSF, and the corresponding BIP to infer the parameters of the generative models for an individual in a single test. We then introduce the three-level HBJM to account for the row-by-row data from both VA and CSF assessments on all individuals within a single structure and compute the joint distributions of the collective endpoint (CE5D), comprised of parameters of the two generative models, across all tests and individuals and at the population level. Finally, we describe the computational and statistical inference procedures. In the current development, we treat each unique combination of eye and Bangerter foil condition as an individual. 
Generative model of visual acuity and Bayesian inference procedure
With parameters \(\theta _{ij}^{VA} = ( {\theta _{ij,Threshold}^{VA},\theta _{ij,Range}^{VA}} )\), where \(\theta _{ij,Threshold}^{VA}\) is VA threshold corresponding to the = 2 performance level, and \(\theta _{ij,Range}^{VA}\) is VA range defined as the difference between VA thresholds at = 1 and = 4 performance levels, the generative model of visual acuity (Figure 4A) specifies the probability of obtaining rijm (=0, 1, 2, or 3) correct responses from individual i tested with optotype size osijm in row m of the jth test (Figure 2A) (Lesmes & Dorr, 2019; Zhao, Lesmes, Dorr, Bex, et al., 2021):  
\begin{eqnarray}p\left( {r_{ijm}^{VA}{\rm{|}}\theta _{ij}^{VA},o{s_{ijm}}} \right) = {\rm{G}}\left( {o{s_{ijm}},\theta _{ij}^{VA}} \right),\quad\end{eqnarray}
(1)
where \({\rm{G}}( {o{s_{ijm}},\theta _{ij}^{VA}} )\) is the probability of the number of correct identifications in a row (Supplementary Material A). The probability of observing the responses of individual i in all M rows in the jth test is the product of \(p( {r_{ijm}^{VA}{\rm{|}}\theta _{ij}^{VA},o{s_{ijm}}} )\) in all rows:  
\begin{eqnarray} && p\left( {r_{ij,1:M}^{VA}{\rm{|}}\theta _{ij}^{VA},o{s_{ij,1:M}}} \right)\nonumber\\ && \quad = \mathop \prod \limits_{m = 1}^M p\left( {r_{ijm}^{VA}{\rm{|}}\theta _{ij}^{VA},o{s_{ijm}}} \right).\quad\end{eqnarray}
(2)
 
Figure 4.
 
(A) The generative model of visual acuity specifies the VABF of individual i in test j with two parameters: \(\theta _{ij}^{VA} = ( {\theta _{ij,Threshold}^{VA},\theta _{ij,Range}^{VA}} )\), where \(\theta _{ij,Threshold}^{VA}\) and \(\theta _{ij,Range}^{VA}\) are the VA threshold and VA range. (B) The generative model of CSF specifies the CSF of individual i in test j with three parameters\(:\;\theta _{ij}^{CSF} = ( {\theta _{ij,PG}^{CSF},\theta _{ij,PF}^{CSF},\theta _{ij,BH}^{CSF}} )\), where \(\theta _{ij,PG}^{CSF}\;\)is the peak sensitivity, \(\theta _{ij,PF}^{CSF}\) is the peak spatial frequency (cycles/degree), and \(\theta _{ij,BH}^{CSF}\) is the bandwidth (octaves) at half of the peak sensitivity. (C) A three-level HBJM of the VABF and CSF across multiple individuals and tests. At the population level, μCE and Σ are the mean and covariance hyperparameters of the population. At the individual level \(\rho _i^{{\rm{CE}}}\), ϕCSF, and ϕVA are the mean and covariance hyperparameters of individual i. At the test level, \(\theta _{ij}^{CE}\) are the parameters of individual i in test j.
Figure 4.
 
(A) The generative model of visual acuity specifies the VABF of individual i in test j with two parameters: \(\theta _{ij}^{VA} = ( {\theta _{ij,Threshold}^{VA},\theta _{ij,Range}^{VA}} )\), where \(\theta _{ij,Threshold}^{VA}\) and \(\theta _{ij,Range}^{VA}\) are the VA threshold and VA range. (B) The generative model of CSF specifies the CSF of individual i in test j with three parameters\(:\;\theta _{ij}^{CSF} = ( {\theta _{ij,PG}^{CSF},\theta _{ij,PF}^{CSF},\theta _{ij,BH}^{CSF}} )\), where \(\theta _{ij,PG}^{CSF}\;\)is the peak sensitivity, \(\theta _{ij,PF}^{CSF}\) is the peak spatial frequency (cycles/degree), and \(\theta _{ij,BH}^{CSF}\) is the bandwidth (octaves) at half of the peak sensitivity. (C) A three-level HBJM of the VABF and CSF across multiple individuals and tests. At the population level, μCE and Σ are the mean and covariance hyperparameters of the population. At the individual level \(\rho _i^{{\rm{CE}}}\), ϕCSF, and ϕVA are the mean and covariance hyperparameters of individual i. At the test level, \(\theta _{ij}^{CE}\) are the parameters of individual i in test j.
In other words, with known \(\theta _{ij}^{VA}\), the VA generative model allows us to compute the probability of the number of correct optotype identifications in each row as well as the probability of obtaining the entire dataset in a single test. It can be used to generate the row-by-row responses in VA tests (Figure 2B). 
On the flipside, we can infer the posterior distribution of \(\theta _{ij}^{VA}\) from the row-by-row data \(Y_{ij}^{VA} = \{ {( {r_{ij,1:M}^{VA},o{s_{ij,1:M}}} )} \}\;\)of individual i in the jth test with the BIP (Lesmes et al., 2010; Zhao, Lesmes, Dorr, Bex, et al., 2021; Zhao, Lesmes, Dorr, & Lu, 2021):  
\begin{eqnarray} && p\left( {\theta _{ij}^{VA}{\rm{|}}Y_{ij}^{VA}} \right)\nonumber\\ && \quad = \frac{{\mathop \prod \nolimits_{m = 1}^M p\left( {r_{ijm}^{VA}{\rm{|}}\theta _{ij}^{VA},o{s_{ijm}}} \right){p_0}\left( {\theta _{ij}^{VA}} \right)}}{{\smallint \mathop \prod \nolimits_{m = 1}^M p\left( {r_{ijm}^{VA}{\rm{|}}\theta _{ij}^{VA},o{s_{ijm}}} \right){p_0}\left( {\theta _{ij}^{VA}} \right)d\theta _{ij}^{VA}}},\quad\end{eqnarray}
(3)
where \({p_0}( {\theta _{ij}^{VA}} )\) is the prior probability distribution of \(\theta _{ij}^{VA}\), and the denominator is the integral across all possible values of \(\theta _{ij}^{VA}\)
Generative model of contrast sensitivity function and Bayesian inference procedure
The generative model of the CSF (Figure 4B) specifies contrast sensitivity \(S( {{f_{ijk}},\theta _{ij}^{CSF}} )\) at spatial frequency fijk for individual i in row k of the jth test using a log parabola function with three parameters \(\theta _{ij}^{CSF} = ( {\theta _{ij,PG}^{CSF},\theta _{ij,PF}^{CSF},\theta _{ij,BH}^{CSF}} )\) (Figure 2C) (Lesmes et al., 2010; Rohaly & Owsley, 1993; Watson & Ahumada, 2005; Zhao, Lesmes, Hou, et al., 2021):  
\begin{eqnarray}&&{\rm{lo}}{{\rm{g}}_{10}}\left( {S\left( {{f_{ijk}},\theta _{ij}^{CSF}} \right)} \right) = \;{\rm{lo}}{{\rm{g}}_{10}}\left( {\theta _{ij,PG}^{CSF}} \right)\nonumber\\ && \quad - \frac{4}{{{\rm{lo}}{{\rm{g}}_{10}}\left( 2 \right)}}{\left( {\frac{{{\rm{lo}}{{\rm{g}}_{10}}\left( {{f_{ijk}}} \right) - {\rm{lo}}{{\rm{g}}_{10}}\left( {\theta _{ij,PF}^{CSF}} \right)}}{{\theta _{ij,BH}^{CSF}}}} \right)^2},\quad\end{eqnarray}
(4)
where \(\theta _{ij,PG}^{CSF}\;\)is the peak gain, \(\theta _{ij,PF}^{CSF}\) is the peak spatial frequency (cycles/degree), and \(\theta _{ij,BH}^{CSF}\) is the bandwidth (octaves) at half of the peak sensitivity. The probability of obtaining a correct response is described with a psychometric function (Figure 2D) (Hou et al., 2015):  
\begin{eqnarray} && p\left( {r_{ijkl}^{CSF} = 1{\rm{|}}\theta _{ij}^{CSF},{f_{ijk}},{c_{ijkl}}\;} \right) = g + \left( {1 - g - \frac{\lambda }{2}} \right)\nonumber\\ && \quad {\rm{\Phi }}\left( {\frac{{{\rm{lo}}{{\rm{g}}_{10}}\left( {{c_{ijkl}}} \right) + {\rm{lo}}{{\rm{g}}_{10}}\left( {S\left( {{f_{ijk}},\theta _{ij}^{CSF}} \right)} \right)}}{\sigma }} \right),\end{eqnarray}
(5)
where \(r_{ijkl}^{CSF}\)and cijkl are the response to and contrast of the lth (=1, 2, 3) optotype in row k with a total of L optotypes; fijk is the spatial frequency for all three optotypes in the row; g is the guessing rate, λ, set to 0.04, is the lapse rate (Hou et al., 2015; Lesmes et al., 2010; Wichmann & Hill, 2001), Φ is the standard cumulative Gaussian function, and σ determines the steepness of the psychometric function. The probability of an incorrect response is:  
\begin{eqnarray}&& p\left( {r_{ijkl}^{CSF} = 0{\rm{|}}\theta _{ij}^{CSF},{f_{ijk}},{c_{ijkl}}} \right)\nonumber\\ && \quad = 1 - p\left( {r_{ijkl}^{CSF} = 1{\rm{|}}\theta _{ij}^{CSF},{f_{ijk}},{c_{ijkl}}\;} \right).\quad\end{eqnarray}
(6)
 
The likelihood function, defined by Equations 5 and 6, quantifies the probability of a correct or incorrect response given the stimulus and CSF parameters in a row. The probability of observing the particular responses of individual i in all K rows in the jth test is the product of \(p( {r_{ijkl}^{CSF}{\rm{|}}\theta _{ij}^{CSF},{f_{ijk}},{c_{ijkl}}} )\) in all rows:  
\begin{eqnarray} && p\left( {r_{ij,1:K,1:L}^{CSF}{\rm{|}}\theta _{ij}^{CSF},{f_{ij,1:K}},{c_{ij,1:K,1:L}}} \right)\nonumber \\ && \quad = \mathop \prod \limits_{k = 1}^K \mathop \prod \limits_{l = 1}^L p\left( {r_{ijkl}^{CSF}{\rm{|}}\theta _{ij}^{CSF},{f_{ijk}},{c_{ijkl}}} \right).\quad\end{eqnarray}
(7)
 
In other words, with known \(\theta _{ij}^{CSF}\), the CSF generative model allows us to compute the probability of a correct or incorrect response to each optotype in every row as well as the probability of obtaining the entire dataset in a single test. It can be used to generate the row-by-row responses in CSF tests (Figure 2E). 
On the flipside, we can infer the posterior distributions of \(\theta _{ij}^{CSF}\) from the row-by-row data \(Y_{ij}^{CSF} = \{ {( {r_{ij,1:K,1:L}^{CSF},{f_{ij,1:K}},{c_{ij,1:K,1:L}}} )} \}\;\)of individual i in the jth test with the BIP (Hou et al., 2015; Hou et al., 2016; Lesmes et al., 2010):  
\begin{eqnarray} && p\left( {\theta _{ij}^{CSF}{\rm{|}}Y_{ij}^{CSF}} \right)\nonumber\\ && \quad = \frac{{\mathop \prod \nolimits_{k = 1}^K \mathop \prod \nolimits_{l = 1}^L p\left( {r_{ijkl}^{CSF}{\rm{|}}\theta _{ij}^{CSF},{f_{ijk}},{c_{ijkl}}} \right){p_0}\left( {\theta _{ij}^{CSF}} \right)}}{{\smallint \mathop \prod \nolimits_{k = 1}^K \mathop \prod \nolimits_{l = 1}^L p\left( {r_{ijkl}^{CSF}{\rm{|}}\theta _{ij}^{CSF},{f_{ijk}},{c_{ijkl}}} \right){p_0}\left( {\theta _{ij}^{CSF}} \right)d\theta _{ij}^{CSF}}},\quad\end{eqnarray}
(8)
where \({p_0}( {\theta _{ij}^{CSF}} )\) is the prior probability distribution of \(\theta _{ij}^{CSF}\), and the denominator is the integral across all possible values of \(\theta _{ij}^{CSF}\)
HBJM
Although the BIP can be used to infer the parameters of the generative model of an individual in a single test, the prior distributions in the procedure are usually uninformative, and the posterior parameters distributions are computed independently for each assessment in each test of an individual, without considering potential conditional dependencies and covariance of the parameters across assessments, tests and individuals. Here, we developed a three-level HBJM to model the row-by-row data from multiple endpoint assessments of all individuals with a joint parameter and hyperparameter distribution of the CE5D, comprised of the parameters of the generative models of all the endpoint assessments. In the current development, the five-dimensional collective endpoint \(\theta _{ij}^{{\rm{CE}}} = ( {\theta _{ij,{\rm{VA}}}^{{\rm{CE}}},\;\theta _{ij,{\rm{CSF}}}^{{\rm{CE}}}} )\;\)for individual i in the jth test consists of two-dimensional \(\theta _{ij,{\rm{VA}}}^{{\rm{CE}}} = ( {\theta _{ij,Threshold}^{{\rm{CE}}},\theta _{ij,Range}^{{\rm{CE}}}} )\) and three-dimensional \(\theta _{ij,CSF}^{CE} = ( {\theta _{ij,PG}^{CE},\theta _{ij,PF}^{CE},\theta _{ij,BH}^{CE}} )\) components. The CE5D hyperparameters at the population and individual levels are five-dimensional random variables ηCE and \(\tau _i^{{\rm{CE}}}.\) In the HBJM, the distribution at a higher level sets the prior for the means of the distributions at a lower level, i.e., the distribution of the population hyperparameter ηCE sets the prior for \(\rho _i^{{\rm{CE}}}\), the mean of the distribution of the hyperparameter \(\tau _i^{{\rm{CE}}}\) of individual i, which in turn sets the prior for the mean of the distribution of parameter \(\theta _{ij}^{{\rm{CE}}}\) of individual i in the jth test. 
The probability distribution pCE) of the CE5D hyperparameter \({\eta ^{{\rm{CE}}}} = (\eta _{VA}^{{\rm{CE}}},\) \(\eta _{CSF}^{{\rm{CE}}})\) with \(\eta _{VA}^{{\rm{CE}}} = ( {\eta _{Threshold}^{{\rm{CE}}},\eta _{Rang{\rm{e}}}^{{\rm{CE}}}} )\) and \(\eta _{CSF}^{{\rm{CE}}} = ( {\eta _{PG}^{{\rm{CE}}},\eta _{PF}^{{\rm{CE}}},\eta _{BH}^{{\rm{CE}}}} )\) at the population level is modeled as a mixture of five-dimensional Gaussian distributions \(\mathcal{N}( {{\eta ^{{\rm{CE}}}},{\mu ^{{\rm{CE}}}},\Sigma } )\):  
\begin{eqnarray}p\left( {{\eta ^{{\rm{CE}}}}} \right) = \mathcal{N}\left( {{\eta ^{{\rm{CE}}}},{\mu ^{{\rm{CE}}}},\Sigma } \right)p\left( {{\mu ^{{\rm{CE}}}}} \right)p\left( \Sigma \right),\quad\end{eqnarray}
(9)
where μCE is the mean of ηCE, with distribution pCE), and Σ is the covariance of ηCE, with distribution p(Σ). 
The probability distribution \(p( {\tau _i^{{\rm{CE}}}{\rm{|}}{\eta ^{{\rm{CE}}}}} )\;\)of the CE5D hyperparameter \(\tau _i^{\rm CE} = (\tau _{i,VA}^{\rm CE},\) \(\tau _{i,CSF}^{\rm CE})\) of individual i is modeled as the product of two-dimensional \(\tau _{i,VA}^{\rm CE} = ( {\tau _{i,Threshold}^{\rm CE},\tau _{j,Range}^{\rm CE}} )\) and three-dimensional \(\tau _{i,CSF}^{\rm CE} = ( {\tau _{i,PG}^{\rm CE},\tau _{i,PF}^{\rm CE},\tau _{i,BH}^{\rm CE}} )\) Gaussian distributions:  
\begin{eqnarray}&& p\left( {\tau _i^{{\rm{CE}}}{\rm{|}}{\eta ^{{\rm{CE}}}}} \right) = \mathcal{N}\left( {\tau _{i,VA}^{{\rm{CE}}},\rho _{i,VA}^{{\rm{CE}}},{\phi _{VA}}} \right)p\left( {{\phi _{VA}}} \right)\nonumber\\ && \mathcal{N}\left( {\tau _{i,CSF}^{{\rm{CE}}},\rho _{i,CSF}^{{\rm{CE}}},{\phi _{CSF}}} \right)p\left( {{\phi _{CSF}}} \right)p\left( {\rho _i^{{\rm{CE}}}|{\eta ^{{\rm{CE}}}}} \right),\quad\end{eqnarray}
(10)
where \(\rho _i^{{\rm{CE}}} = ( {\rho _{i,{\rm{VA}}}^{{\rm{CE}}},\;\rho _{i,{\rm{CSF}}}^{{\rm{CE}}}} )\) is the mean of the \(\tau _i^{{\rm{CE}}}\;\)distribution and conditioned on ηCE, ϕVA and  ϕCSF are 2  ×  2 and 3  ×  3 covariances of \(\tau _{i,{\rm{VA}}}^{{\rm{CE}}}\;\)and \(\tau _{i,{\rm{CSF}}}^{{\rm{CE}}}\), with distributions pVA) and pCSF). 
The probability distribution \(p(\theta _{ij}^{{\rm{CE}}}|\tau _i^{{\rm{CE}}})\) of the CE5D parameter \(\theta _{ij}^{{\rm{CE}}}\) for individual i in the jth test is the product of two distributions conditioned on \(\tau _i^{{\rm{CE}}}\):  
\begin{eqnarray} p(\theta _{ij}^{{\rm{CE}}}|\tau _i^{{\rm{CE}}}) = p(\theta _{ij,VA}^{{\rm{CE}}}|\tau _{i,VA}^{{\rm{CE}}})p(\theta _{ij,CSF}^{{\rm{CE}}}|\tau _{i,CSF}^{{\rm{CE}}}).\quad\end{eqnarray}
(11)
 
The probability of obtaining the entire dataset, including all the row-by-row data from both VA and CSF assessments of all the individuals, is computed by probability multiplication:  
\begin{eqnarray}\begin{array}{@{}l@{}} p\left( {r_{1:I,1:J,1:M}^{VA},r_{1:I,1:J,1:K,1:L}^{CSF},{\rm{|}}X,o{s_{1:I,1:J,1:M}},{f_{1:I,1:J,1:K}},{c_{1:I,1:J,1:K,1:L}}} \right)\\ = \mathop \prod \limits_{i = 1}^I \mathop \prod \limits_{j = 1}^J p\left( {r_{ij,1:M}^{VA}{\rm{|}}\theta _{ij,VA}^{{\rm{CE}}},o{s_{ij,1:M}}} \right)p\left( {r_{ij,1:K,1:L}^{CSF}{\rm{|}}\theta _{ij,CSF}^{{\rm{CE}}},{f_{ij,1:K}},{c_{ij,1:K,1:L}}} \right)\\ p\left( {\theta _{ij}^{{\rm{CE}}}{\rm{|}}\tau _i^{{\rm{CE}}}} \right)p\left( {\tau _i^{{\rm{CE}}}{\rm{|}}{\eta ^{{\rm{CE}}}}} \right)p\left( {{\eta ^{{\rm{CE}}}}} \right) = \mathop \prod \limits_{i = 1}^I \mathop \prod \limits_{j = 1}^J p\left( {r_{ij,1:M}^{VA}{\rm{|}}\theta _{ij,VA}^{{\rm{CE}}},o{s_{ij,1:M}}} \right)\\ p\left( {\theta _{ij,VA}^{{\rm{CE}}}{\rm{|}}\tau _{i,VA}^{{\rm{CE}}}} \right)p\left( {r_{ij,1:K,1:L}^{CSF}{\rm{|}}\theta _{ij,CSF}^{{\rm{CE}},},{f_{ij,1:K}},{c_{ij,1:K,1:L}}} \right)p\left( {\theta _{ij,CSF}^{{\rm{CE}}}{\rm{|}}\tau _{i,CSF}^{{\rm{CE}}}} \right)\\ \mathcal{N}\left( {\tau _{i,VA}^{{\rm{CE}}},\rho _{i,VA}^{{\rm{CE}}},{\phi _{VA}}} \right)\mathcal{N}\left( {\tau _{i,CSF}^{{\rm{CE}}},\;\rho _{i,CSF}^{{\rm{CE}}},{\phi _{CSF}}} \right)p\left( {{\phi _{VA}}} \right) p\left( {{\phi _{CSF}}} \right)\\ p\left( {\rho _i^{{\rm{CE}}}|{\eta ^{{\rm{CE}}}}} \right)\mathcal{N}\left( {{\eta ^{{\rm{CE}}}},{\mu ^{{\rm{CE}}}},\Sigma } \right)p\left( {{\mu ^{{\rm{CE}}}}} \right)p\left( \sum \right) \end{array}\quad\end{eqnarray}
(12)
where \(X = ( {\theta _{1:I,1:J}^{{\rm{CE}}},\rho _{1:I}^{{\rm{CE}}},{\phi _{VA}},{\phi _{CSF}},{\mu ^{{\rm{CE}}}},\Sigma } )\) are all the parameters and hyperparameters in the HBJM, I is the total number of individuals, and J is the total number of tests on each individual. 
We start with prior distributions of μCE, Σ, ϕCSF, and ϕVA. 
\begin{eqnarray}{p_0}\left( {{\mu ^{{\rm{CE}}}}} \right) = \mathcal{U}\left( {\mu _{0,min}^{{\rm{CE}}},\mu _{0,max}^{{\rm{CE}}}} \right),\qquad\end{eqnarray}
(13a)
 
\begin{eqnarray}{p_0}\left( {{\Sigma ^{ - 1}}} \right) = \mathcal{W}\left( {\Sigma _{BIP}^{ - 1}/{\rm{\nu }},{\rm{\nu }}} \right),\qquad\end{eqnarray}
(13b)
 
\begin{eqnarray}{p_0}\left( {\phi _{VA}^{ - 1}} \right) = \mathcal{W}\left( {\phi _{qVA}^{ - 1}/{{\rm{\nu }}_{VA}},{{\rm{\nu }}_{VA}}} \right),\qquad\end{eqnarray}
(13c)
 
\begin{eqnarray}{p_0}\left( {\phi _{CSF}^{ - 1}} \right) = \mathcal{W}\left( {\phi _{qCSF}^{ - 1}/{{\rm{\nu }}_{CSF}},{{\rm{\nu }}_{CSF}}} \right),\qquad\end{eqnarray}
(13d)
where \(\mathcal{U}\) is a five-dimensional uniform distribution with lower bound \(\mu _{0,min}^{{\rm{CE}}} = ( {1.05,{\rm{\;}}0.1,{\rm{\;}}1,{\rm{\;}} - 0.5,{\rm{\;}}0.1} )\) and upper bound \(\mu _{0,max}^{{\rm{CE}}} = ( {1050,\;20,\;9,\;1.3,\;1.5} )\); precision matrices Σ−1,\(\;\phi _{VA}^{ - 1}\), and \(\phi _{CSF}^{ - 1}\) are the inverse of covariances Σ, ϕVA, and ϕCSF; \(\mathcal{W}( {\frac{{\rm{Y}}}{{{{\rm{\nu }}_{\rm{Y}}}}},{{\rm{\nu }}_{\rm{Y}}}} )\) denotes a Wishart distribution with mean precision matrix Y and degrees of freedom νY, with ν = 5, νVA = 2 , and νCSF = 3;\(\;\phi _{qVA}^{ - 1}\) and \(\phi _{qCSF}^{ - 1}\) are the inverse of the average covariance matrices ϕqVA and ϕqCSF computed from the BIP posterior distributions across all qVA and qCSF tests; and Σ−1 is the inverse of the covariance matrix Σ computed from the estimated \(\theta _{ij}^{VA}\) and \(\theta _{ij}^{CSF}\) across all qVA and qCSF tests. 
The joint posterior distribution of all the parameters and hyperparameters in the HBJM is computed by Bayes’ rule:  
\begin{eqnarray}\begin{array}{@{}l@{}} p\left( {X{\rm{|}}Y_{1:I,1:J,1:M}^{VA},Y_{1:I,1:J,1:K,1:L}^{CSF}} \right)\\ = \frac{{\mathop \prod \nolimits_{i = 1}^I \mathop \prod \nolimits_{j = 1}^J p\left( {r_{ij,1:M}^{VA}{\rm{|}}\theta _{ij,VA}^{{\rm{CE}}}o{s_{ij,1:M}}} \right)p\left( {\theta _{ij}^{{\rm{CE}}}{\rm{|}}\tau _i^{{\rm{CE}}}} \right)p\left( {r_{ij,1:K,1:L}^{CSF}{\rm{|}}\theta _{ij,CSF}^{{\rm{CE}}},{f_{ij,1:K}},{c_{ij,1:K,1:L}}} \right)}}{{\smallint \mathop \prod \nolimits_{i = 1}^I \mathop \prod \nolimits_{j = 1}^J p\left( {r_{ij,1:M}^{VA}{\rm{|}}\theta _{ij,VA}^{{\rm{CE}}},o{s_{ij,1:M}}} \right)p\left( {\theta _{ij}^{{\rm{CE}}}{\rm{|}}\tau _i^{{\rm{CE}}}} \right)p\left( {r_{ij,1:K,1:L}^{CSF}{\rm{|}}\theta _{ij,CSF}^{{\rm{CE}}},{f_{ij,1:K}},{c_{ij,1:K,1:L}}} \right)}}\\ \frac{{\mathcal{N}\left( {\tau _{i,VA}^{{\rm{CE}}},\rho _{i,VA}^{{\rm{CE}},},{\phi _{VA}}} \right)\mathcal{N}\left( {\tau _{i,CSF}^{{\rm{CE}}},\rho _{i,CSF}^{{\rm{CE}}},{\phi _{CSF}}} \right){p_0}\left( {{\phi _{VA}}} \right){p_0}\left( {{\phi _{CSF}}} \right)p\left( {\rho _i^{{\rm{CE}}}|{\eta ^{{\rm{CE}}}}} \right)\mathcal{N}\left( {{\eta ^{{\rm{CE}}}},{\mu ^{{\rm{CE}}}},\Sigma } \right){p_0}\left( {{\mu ^{{\rm{CE}}}}} \right){p_0}\left( \Sigma \right)}}{{\mathcal{N}\left( {\tau _{i,VA}^{{\rm{CE}}},\rho _{i,VA}^{{\rm{CE}}},{\phi _{VA}}} \right)\mathcal{N}\left( {\tau _{i,CSF}^{{\rm{CE}}},\rho _{i,CSF}^{{\rm{CE}}},{\phi _{CSF}}} \right){p_0}\left( {{\phi _{VA}}} \right){p_0}\left( {{\phi _{CSF}}} \right)p\left( {\rho _i^{{\rm{CE}}}|{\eta ^{{\rm{CE}}}}} \right)\mathcal{N}\left( {{\eta ^{{\rm{CE}}}},{\mu ^{{\rm{CE}}}},\Sigma } \right){p_0}\left( {{\mu ^{{\rm{CE}}}}} \right){p_0}\left( \Sigma \right)dX}}, \end{array}\quad\end{eqnarray}
(14)
where the denominator is the integral of the probability of obtaining the entire dataset across all possible values of X and is a constant for a given dataset and HBJM. 
Computing the joint posterior distribution
We used R (R Core Team, 2003) function run.jags in JAGS (Plummer, 2003) to compute representative samples of the joint posterior distribution of \(\theta _{i1}^{{\rm{CE}}}\;\)(5 parameters × 56 individuals=280 parameters), \(\rho _i^{{\rm{CE}}}\) (5 parameters × 56 subjects = 280 parameters), ϕCSF and ϕVA (9 parameters), μCE(5 parameters), and Σ (15 parameters) in three Markov Chain Monte Carlo (MCMC) chains. The MCMC efficiently samples the joint posterior distribution in a random walk (Kruschke, 2014). It started at a position randomly selected in the 589-dimensional parameter space. In each step, one of the 589 parameters was randomly selected. The values of all the other 588 parameters at the current position were fixed and the one-dimensional conditional probability distribution of the selected parameter was evaluated, from which a new value of the selected parameter was generated. The process reiterated until a pre-defined convergence criterion was reached so that the probability of visiting a position in the random walk approximated the joint posterior distribution of all the 589 parameters in Equation 14
Because the arbitrary starting position influenced the initial part of the random walk, steps in the adaptation and burn-in (optimization of the sampling algorithm by JAGS) phases were discarded and not included in the analysis. The exact number of steps discarded depends on each model and data. Convergence of each parameter was evaluated with Gelman and Rubin's diagnostic rule (Gelman & Rubin, 1992) based on the ratio of between- and within-MCMC variances along each dimension, that is, the variance of the samples across MCMC chains divided by the variance of the samples within each MCMC chain. The HBJM was considered “converged” when the Gelman and Rubin's statistics for all the parameters were smaller than 1.05. In this study, 500,000 and 20,000 steps were used for adaptation and burn-in in each MCMC chain based on pilot studies. The random walk continued until 2,000,000 additional samples were generated in each MCMC chain. 20,000 of the 2,000,000 samples (1 of every 100 sample) were stored (thinning ratio = 100). We fit the HBJM to three test lengths: short (the first 5 qVA and first 15 qCSF rows; M = 5, K = 15), medium (the first 15 qVA and first 25 qCSF rows; M = 15, K = 25), and long (all qVA and qCSF rows; M = 45, K = 50), separately. 
Similarly, we used R (R Core Team, 2003) function run.jags in JAGS (Plummer, 2003) to compute representative samples of the posterior distribution of \(\theta _{i1}^{VA}\) and \(\theta _{i1}^{CSF}\) for each qVA and qCSF test independently using their respective BIP. Five thousand and 5000 steps were used for adaptation and burn-in in each MCMC chain based on pilot studies. The random walk continued until 10,000 were stored in each MCMC chain with thinning ratio = 150, 50, and 50 for short, medium, and long test lengths, respectively. 
Statistical analysis
We evaluated the goodness of fit of the BIP and HBJM to the entire dataset using Bayesian predictive information criterion (BPIC) (Ando, 2007; Ando, 2011), which quantifies the likelihood of obtaining the observed data given the joint probability distribution of all the parameters of a model and penalizes model complexity. Smaller BPIC value indicates better fit. 
We then compared the uncertainties of the estimated \(\theta _{i1}^{VA},\theta _{i1}^{CSF}\), and \(\theta _{i1}^{{\rm{CE}}}\). The uncertainty was quantified as the variance of the marginal probability distribution of each parameter (Clayton & Hills, 1993; Edwards et al., 1963). Hotelling's T-squared test (Anderson, 2003) was used to compare the variances of \(\theta _{i1}^{{\rm{BIP}}} = (\theta _{i1}^{VA},\theta _{i1}^{CSF})\) and \(\theta _{i1}^{{\rm{CE}}}\) with R function HotellingsT2 (Nordhausen, Sirkia, Oja, & Tyler, 2018; R Core Team, 2003). 
Finally, we unblinded the data and computed sensitivity at 95% specificity, accuracy, and p value between the six Bangerter foil condition pairs (F0-F1, F0-F2, F0-F3, F1-F2, F1-F3, F2-F3) for (1) each eye based on distributions of \(\theta _{i1}^{VA}\), \(\theta _{i1}^{CSF}\), and \(\theta _{i1}^{{\rm{CE}}}\), and (2) group-distributions constructed by random sampling from distributions of \(\theta _{i1}^{VA}\), \(\theta _{i1}^{CSF}\), and \(\theta _{i1}^{{\rm{CE}}}\)
At the test level for individual eyes, the distributions from the same eye in each condition pair were used for analysis based on VA threshold from the BIP (VA1D), area under the log CSF from the BIP (AULCSF1D), and the collective endpoint (CE5D) from the HBJM. AULCSF1D was constructed by computing the AULCSF between 1.5 and 18 cycles per degree. 
The group level null and difference distributions of VA1D, AULCSF1D, and CE5D between a condition pair, p(x|Gnull) and p(x|Gdiff), where x = VA1D, AULCSF1D, or CE5D, were constructed from the corresponding individual test-level distributions in the two conditions: (1) For each eye, a pair of random samples were drawn from the test-level distributions in the two conditions. The difference between the two samples was computed. (2) Step (1) was repeated for all 14 eyes. (3) The mean difference across the 14 eyes was added as a new sample in p(x|Gdiff). (4) By repeating (1) ∼(3), p(x|Gdiff) was constructed. And (5) the baseline distribution p(x|Gnull) was constructed by shifting the mean of p(x|Gdiff) to the origin. 
Similarly, for the power analysis, the group level null and difference distributions of VA1D, AULCSF1D, and CE5D for sample sizes N = 1∼14 between each condition pair, p(x|Gnull) and p(x|Gdiff), where x = VA1D, AULCSF1D, or CE5D, were constructed from the corresponding individual test-level distributions in the two conditions: (1) N eyes were randomly selected with replacement from the 14 eyes. (2) For each selected eye, a pair of random samples were drawn from the test-level distributions in the two conditions. The difference between the two samples was computed. (3) Step (2) was repeated for the N randomly selected eyes. (4) The mean difference across the N eyes was added as a new sample in p(x|Gdiff). (5) By repeating (1) ∼(4) 10,000 times, p(x|Gdiff) was constructed. (6) the baseline distribution p(x|Gnull) was constructed by shifting the mean of p(x|Gdiff) to zero (the origin). 
Probability density function (pdf) was obtained by fitting two-, three- and five- dimensional Gaussian mixture models (GMMs) to the MCMC samples of \(\theta _{i1}^{VA}\), \(\theta _{i1}^{CSF}\), and \(\theta _{i1}^{{\rm{CE}}}\), respectively, and the samples in the corresponding difference distributions. MATLAB function fitgmdist was used to fit GMMs with one to six Gaussian components. The Bayes information criterion (BIC) was computed for each GMM fit. The best fitting GMM was determined from the elbow of the BIC versus number of Gaussian component function. 
Given a baseline distribution p(x|B) and a treatment distribution p(x|T) (Figure 5A), specificity is the probability of correctly identifying a random sample xs,B from p(x|B), and sensitivity is the probability of correctly identifying a random sample xs,T from p(x|T) (Green & Swets, 1966; Swets & Pickett, 1982; US Department of Health and Human Services et al., 2007). We used the likelihood ratio \(\frac{{pdf( {x{\rm{|}}B} )}}{{pdf( {x{\rm{|}}T} )}}\) to define the criterion and computed sensitivity and specificity over a wide range of criteria to construct the receiver operating characteristic curves (ROC): (1-specificity) versus sensitivity. Accuracy is quantified by the area under ROC (AUROC), which is a criterion-free measure in signal detection theory (Figure 5B). (Green & Swets, 1966; Swets & Pickett, 1982; US Department of Health and Human Services et al., 2007) For a given pair of p(x|B) and p(x|T), the p value is defined as the probability of a random sample xs,B from p(x|B) being incorrectly identified as from p(x|T) with \(\frac{{pdf( {x{\rm{|}}B} )}}{{pdf( {x{\rm{|}}T} )}} \le 1\) (Figure 5A):  
\begin{eqnarray}P = \mathop \smallint \nolimits_{pdf({x_{s,B}}|B) \le pdf({x_{s,B}}|T)}^{} p\left( {{x_{s,B}}{\rm{|}}B} \right)d{x_{s,B}}.\quad\end{eqnarray}
(15)
 
Figure 5.
 
(A) Scatter plots of xs,B (red dots) from the base-line distribution p(x|B) and xs,T (blue dots) from the treatment distribution p(x|T). Black dashed line represents a criterion of \(\frac{{pdf( {x{\rm{|}}B} )}}{{pdf( {x{\rm{|}}T} )}} = 1\) and the iso-density contours illustrate the best fitting Gaussian mixture models to the two distributions. (B) The bold black curve is the ROC curve. The grey area under the ROC is the AUROC, a criterion-free estimate of the test accuracy. Black dashed line represents chance performance.
Figure 5.
 
(A) Scatter plots of xs,B (red dots) from the base-line distribution p(x|B) and xs,T (blue dots) from the treatment distribution p(x|T). Black dashed line represents a criterion of \(\frac{{pdf( {x{\rm{|}}B} )}}{{pdf( {x{\rm{|}}T} )}} = 1\) and the iso-density contours illustrate the best fitting Gaussian mixture models to the two distributions. (B) The bold black curve is the ROC curve. The grey area under the ROC is the AUROC, a criterion-free estimate of the test accuracy. Black dashed line represents chance performance.
In this article, p(x|B) and p(x|T) are test-level distributions in two conditions for individual eyes and p(x|Gnull) and p(x|Gdiff) at the group level. We also computed the p values (Equation 15) between \(\theta _{i1}^{VA}\) and the VA component of \(\theta _{i1}^{{\rm{CE}}}\), and between \(\theta _{i1}^{CSF}\) and the CSF parameter components of \(\theta _{i1}^{{\rm{CE}}}\) to quantify the agreement between BIP and HBJM estimates in each test, with BIP as the baseline. 
Results
We report the detailed results based on the medium test length in the main text. The results from the short and long test lengths were largely consistent and can be found in Supplementary Materials B and C
Goodness-of-fit
The HBJM (BPICHBJM = 4591) fit the data better than the BIP (BPICBIP = 4617). 
Posterior distributions from the HBJM and the BIP
Figure 6 illustrates the two-dimensional marginal posterior distributions of ηCE at the population level. We found large correlations among ηCE components (Table 1). Specifically, \(\eta _{Threshold}^{{\rm{CE}}}\) was positively correlated with \(\eta _{Range}^{{\rm{CE}}}\) (r = 0.516), but negatively correlated with \(\eta _{PG}^{{\rm{CE}}}\) (r = −0.486), \(\eta _{PF}^{{\rm{CE}}}\) (r = −0.583), and \(\eta _{BH}^{{\rm{CE}}}\) (r = −0.288); \(\eta _{range}^{{\rm{CE}}}\) was negatively correlated with \(\eta _{PF}^{{\rm{CE}}}\) (r = −0.494); \(\eta _{BH}^{{\rm{CE}}}\) was positively correlated with \(\eta _{PG}^{{\rm{CE}}}\) (r = 0.577), but was negatively correlated with \(\eta _{PF}^{{\rm{CE\;}}}\)(r = −0.536). The positive correlation between \(\eta _{Threshold}^{{\rm{CE}}}\) and \(\eta _{Range}^{{\rm{CE}}}\) is consistent with our previous observation that worse VA thresholds were associated with wider ranges (shallower slope of VABF) across individuals (Zhao, Lesmes, Dorr, Bex, et al., 2021; Zhao, Lesmes, Dorr, & Lu, 2021). The negative correlations between \(\eta _{VA}^{{\rm{CE}}}\) and \(\eta _{CSF}^{{\rm{CE}}}\) components were expected because increased VA threshold is associated with worse vision, and increased CSF is associated with better vision. The correlations among \(\eta _{CSF}^{{\rm{CE}}}\) components indicated that individuals with wider CSF bandwidths tended to have higher peak gains but lower peak frequencies. 
Figure 6.
 
Two-dimensional marginal posterior distributions of ηCE.
Figure 6.
 
Two-dimensional marginal posterior distributions of ηCE.
Table 1.
 
Correlations of ηCEcomponents at the population level.
Table 1.
 
Correlations of ηCEcomponents at the population level.
Figure 7 illustrates two-dimensional marginal posterior distributions of \(\tau _i^{{\rm{CE}}}\) for one individual. Table 2 shows the correlations of \(\tau _i^{{\rm{CE}}}\) components, averaged across all individuals. \(\tau _{i,Threshold}^{{\rm{CE}}}\) and \(\tau _{i,Range}^{{\rm{CE}}}\) did not exhibit large correlation, consistent with our previous publication (Zhao, Lesmes, Dorr, & Lu, 2021), because they quantified two independent properties of the VABF, namely the position and shape of the psychometric function at the individual level (Figure 2A). We found large correlations among \(\tau _{i,CSF}^{{\rm{CE}}}\;\)components: \(\tau _{i,PF}^{{\rm{CE}}}\) was negatively correlated with \(\tau _{i,PG}^{{\rm{CE}}}\;\)(r = −0.757 ± 0.028) and \(\tau _{i,BH}^{{\rm{CE}}}\) (r = −0.900 ± 0.010), whereas \(\tau _{i,PG}^{{\rm{CE}}}\) was positively correlated with \(\tau _{i,BH}^{{\rm{CE}}}\;\)(r = 0.551 ± 0.047). The correlations among \(\tau _{i,CSF}^{{\rm{CE}}}\) components resulted from the properties of the generative model of CSF: The three CSF model parameters jointly determine contrast sensitivities at all the spatial frequencies and different combinations of parameter values may result in similar probabilities of correct responses in some experimental conditions. On the other hand, correlations between \(\tau _{i,VA}^{{\rm{CE}}}\) and \(\tau _{i,CSF\;}^{{\rm{CE}}}\)were small because VA and CSF assessments were conducted separately. 
Figure 7.
 
Two-dimensional marginal posterior distributions of \(\tau _i^{{\rm{CE}}}\) for one individual from the HBJM.
Figure 7.
 
Two-dimensional marginal posterior distributions of \(\tau _i^{{\rm{CE}}}\) for one individual from the HBJM.
Table 2.
 
Correlations of \(\tau _i^{{\rm{CE}}}\) components, averaged across all individuals and with SD in parentheses.
Table 2.
 
Correlations of \(\tau _i^{{\rm{CE}}}\) components, averaged across all individuals and with SD in parentheses.
The two-dimensional marginal posterior distributions of \(\theta _{i1}^{{\rm{CE}}}\) at the test level for the same individual are illustrated in Figure 8. The posterior distributions of \(\theta _{i1}^{VA}\) and \(\theta _{i1}^{CSF}\) from the BIP are illustrated in Figure 9. The corresponding correlations, averaged across all tests from the two analyses, are listed in Tables 3 and 4. We found large correlations among \(\theta _{i1,{\rm{CSF}}}^{{\rm{CE}}}\) components and among \(\theta _{i1}^{CSF}\) components. \(\theta _{i1,PF}^{{\rm{CE}}}\) was negatively correlated with \(\theta _{i1,PG\;}^{{\rm{CE}}}\)(r = −0.730 ± 0.210) and \(\theta _{i1,BH}^{{\rm{CE}}}\) (r = −0.930 ± 0.024), whereas \(\theta _{i1,PG}^{{\rm{CE}}}\) was positively correlated with \(\theta _{i1,BH\;}^{{\rm{CE}}}\)(r = 0.520 ± 0.271). \(\theta _{i1,PF}^{CSF}\) was negatively correlated with \(\theta _{i1,PG}^{CSF}\) (r = −0.762 ± 0.271) and \(\theta _{i1,BH}^{CSF}\) (r = −0.931 ± 0.026), whereas \(\theta _{i1,PG}^{CSF}\) was positively correlated with \(\theta _{i1,BH}^{CSF}\) (r = 0.562 ± 0.355). These patterns are consistent with those at the individual level and our previous publications (Zhao, Lesmes, Dorr, Bex, et al., 2021; Zhao, Lesmes, Dorr, & Lu, 2021; Zhao, Lesmes, Hou, et al., 2021). 
Figure 8.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{{\rm{CE}}}\) at the test level for the same individual in Figure 7 from the HBJM.
Figure 8.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{{\rm{CE}}}\) at the test level for the same individual in Figure 7 from the HBJM.
Figure 9.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{VA}\) and \(\theta _{i1}^{CSF}\) for the same individual in Figure 7 from the BIP.
Figure 9.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{VA}\) and \(\theta _{i1}^{CSF}\) for the same individual in Figure 7 from the BIP.
Table 3.
 
Correlations of \(\theta _{i1}^{{\rm{CE}}}\) components, averaged across all tests and with SD in parentheses.
Table 3.
 
Correlations of \(\theta _{i1}^{{\rm{CE}}}\) components, averaged across all tests and with SD in parentheses.
Table 4.
 
Correlations of \(\theta _{i1}^{CSF}\) and\(\;\theta _{i1}^{VA}\) components, averaged across all tests and with SD in parentheses.
Table 4.
 
Correlations of \(\theta _{i1}^{CSF}\) and\(\;\theta _{i1}^{VA}\) components, averaged across all tests and with SD in parentheses.
Mean and uncertainty of test parameters
The BIP and HBJM estimates of the parameters at the test level showed excellent agreement. The p values for the difference between the posterior distributions of \(\theta _{i1}^{VA}\) and \(\theta _{i1}^{CSF}\) and those of the corresponding \(\theta _{i1}^{{\rm{CE}}}\) components were 0.20 to 0.41 and 0.20 to 0.45. The average variance of the estimated \(\theta _{i1}^{{\rm{CE}}}\) components (mean = 0.0091 log10 units; range 0.00026∼0.059 log10 units) was 71.5% less than that of the corresponding \(\theta _{i1}^{{\rm{BIP}}}\) components (mean = 0.032 log10 units; range 0.00026∼0.30 log10 units) (t2 (5, 51) = 46, p < 0.001), consistent with the well-known variance shrinkage effect of HBMs (Kruschke, 2014). Figure 10 shows histograms of the variance of \(\theta _{i1}^{{\rm{BIP}}}\) and \(\theta _{i1}^{{\rm{CE}}}\) components. 
Figure 10.
 
Histograms of the variance of \(\theta _{i1}^{{\rm{BIP}}}\) and \(\theta _{i1}^{{\rm{CE}}}\) components.
Figure 10.
 
Histograms of the variance of \(\theta _{i1}^{{\rm{BIP}}}\) and \(\theta _{i1}^{{\rm{CE}}}\) components.
Signal-to-noise analysis
The mean VA (Figures 11A–C) and AULCSF (Figures 11D–F) changes between pairs of foil conditions were virtually equal across BIP and CE analyses in all test durations. The recovery of strong covariances by CE analysis reduced the variance of CE estimates relative to BIP estimates (Figures 11G–L), especially for the short testing duration, consistent with our previous HBM results in each single modality of CSF and VA (Zhao, Lesmes, Dorr, & Lu, 2021; Zhao, Lesmes, Hou, et al., 2021). In addition, CE5D combines signals from VA and CSF. The increased signal and reduced noise of CE5D improved signal detection. At the test level for individual eyes, median p values only for CE5D from the five-dimensional HBJM analysis were all significant for short (p < 0.02), medium (p < 0.002), and long testing (p < 0.0004) (Figure 12). The advantage provided by the HBJM was stronger when less information was available with the short and medium test lengths relative to those with the long test length. 
Figure 11.
 
(A–C) Average signal between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D across 14 eyes. (D–F) Average signal between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (G–I) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D. (J–L) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (1dB = 0.1 log10 units).
Figure 11.
 
(A–C) Average signal between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D across 14 eyes. (D–F) Average signal between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (G–I) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D. (J–L) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (1dB = 0.1 log10 units).
Figure 12.
 
Medium p values between F1 and F2 (A), F1 and F3 (B,), and F2 and F3 (C) for VA1D , AULCSF1D, and CE5D in the 14 eyes.
Figure 12.
 
Medium p values between F1 and F2 (A), F1 and F3 (B,), and F2 and F3 (C) for VA1D , AULCSF1D, and CE5D in the 14 eyes.
Sensitivity at 95% specificity, accuracy, and p value
Since the differences between condition pairs F0-F1, F0-F2, and F0-F3 were large, sensitivity at 95% specificity and accuracy were all close to 100% and the p values were close to 0 for VA1D, AULCSF1D, and CE5D at both the individual and group levels. CE5D exhibited advantage over VA1D and AULCSF1D between condition pairs with smaller differences (F1-F2, F2-F3, F1-F3), and the advantage was more profound when less information was collected with shorter test lengths or smaller numbers of eyes (see results for the short and long test lengths in Supplementary Materials B and C). 
Test level
Tables 5 and 6 show the average sensitivity at 95% specificity and accuracy across all eyes between the six condition pairs at the test level, with standard error (SE) in parentheses. CE5D exhibited significantly higher sensitivity at 95% specificity than VA1D between F1 and F2 (t(13) = 2.9, p = 0.006), between F1 and F3 (t(13) = 1.9, p = 0.04), and between F2 and F3 (t(13) = 3.5, p = 0.002), and AULCSF1D between F1 and F2 (t(13) = 2.7, p = 0.009), and between F2 and F3 (t(13) = 2.3, p = 0.02). CE5D exhibited significantly higher accuracy than VA1D between F1 and F2 (t(13) = 2.6, p = 0.01), and between F2 and F3 (t(13) = 3.2, p = 0.004), and AULCSF1D between F1 and F2 (t(13) = 2.2, p = 0.02), and between F2 and F3 (t(13) = 2.0, p = 0.03). Table 7 shows that larger number of eyes reached p < 0.01 with CE5D relative to VA1D and AULCSF1D between F1 and F2, between F1 and F3, and between F2 and F3, indicating highest discriminative power for CE5D
Table 5.
 
Average (SE) sensitivity (%) at 95% specificity of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 5.
 
Average (SE) sensitivity (%) at 95% specificity of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 6.
 
Average (SE) accuracy (%) of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 6.
 
Average (SE) accuracy (%) of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 7.
 
Number of eyes required to reach significant differences (p < 0.01) based on VA1D, AULCSF1D, and CE5D.
Table 7.
 
Number of eyes required to reach significant differences (p < 0.01) based on VA1D, AULCSF1D, and CE5D.
Group level
At the group level, sensitivity at 95% specificity and accuracy between all condition pairs based on VA1D, AULCSF1D, and CE5D were all 100% and the p values were all <0.001, except for those of VA1D between F2 and F3 with sensitivity and p values of 99% and 0.03. 
Power analysis
CE5D exhibited much more statistical power than VA1D and AULCSF1D (Figure 13). Only five, two, and five eyes were needed for CE5D to reached 95% sensitivity at 95% specificity between F1 and F2, F1 and F3, and F2 and F3. Many more eyes were needed for VA1D and AULCSF1D to reach the same level of specificity. Similar results were obtained in terms of accuracy and p values. 
Figure 13.
 
(A–C) Sensitivity at 95% specificity between F1-F2, F1-F3, and F2-F3 condition pairs as functions of sample size for VA1D, AULCSF1D, and CE5D. (D–F) Accuracy between F1-F2, F1-F3, and F2-F3 condition pairs. (G–I) p value between F1-F2, F1-F3, and F2-F3 condition pairs.
Figure 13.
 
(A–C) Sensitivity at 95% specificity between F1-F2, F1-F3, and F2-F3 condition pairs as functions of sample size for VA1D, AULCSF1D, and CE5D. (D–F) Accuracy between F1-F2, F1-F3, and F2-F3 condition pairs. (G–I) p value between F1-F2, F1-F3, and F2-F3 condition pairs.
Discussion
In this article, we developed a three-level HBJM framework to compute a collective endpoint (CE5D) from multiple assessment modalities by modeling data from two test modalities. Using information across all individuals and test modalities, the framework explicitly quantifies the statistical relationships between multiple test modalities and generates a multidimensional collective endpoint that can improve statistical power. We applied the framework to compute the five-dimensional CE5D, consisting of VA threshold, VA range, peak gain, peak spatial frequency, and bandwidth, from trial-by-trial qVA and qCSF data. The HBJM revealed strong correlations among CE5D components at the population, individual and test levels. Compared with the one-dimensional endpoints from BIP analysis (VA1D and AULCSF1D), the five-dimensional CE5D from the HBJM generated the highest sensitivity at 95% specificity, the highest accuracy, and the most significant p values at both the test and group levels. The higher sensitivity and accuracy and more significant p values of CE5D resulted mainly from the HBJM, which optimally combines information from test modalities, individuals, and tests to mutually constrain the estimates to reduce the uncertainties of the parameters and hyperparameters of the VA and CSF models (Figures 11 and 12). 
Because of the strong correlations between the VA and CSF components of CE5D at the population level, the HBJM can be used to generate informative priors for qCSF based on qVA assessment or priors for qVA based on qCSF assessment by treating the to-be-observed data as missing data in the model. We previously developed the hierarchical adaptive design optimization (Gu et al., 2016; Kim et al., 2014) algorithm to simultaneously categorize each new patient into existing groups and provide the most informative prior for them based on their group membership for qCSF. A combination of hierarchical adaptive design with the collective endpoint analysis developed in this article may further improve the efficiency of adaptive tests in multiple assessment modalities. 
Although it was applied to a dataset with only two assessment modalities in this study, the HBJM can be extended to compute CE from more than two assessment modalities. A generative model of each endpoint is required in the HBJM framework. In the current study, generative models of two functional vision endpoints (VA and CSF) enabled our development of CE5D of VA and CSF. Generative models of other endpoints including clinical events (Yin et al., 2018) and structural endpoints can be developed and incorporated into the HBJM framework. They can be either statistical models that capture the probability of certain aspects of a human condition or mechanistic models based on human biology or physiology. For example, optical coherence tomography (OCT) (Huang et al., 1991) uses light to generate cross-section retina images. Anatomical changes (e.g., thickness of a retina layer) identified with OCT have been related to the severities of eye diseases (Vujosevic et al., 2022). A generative model can be constructed by specifying the probability of an eye disease and its severity as a function of the anatomical features of the retina, such as the thickness of retinal layers in different locations. The HBJM can be extended to compute CE from OCT, VA, and CSF and further facilitate disease staging and treatment monitoring and improve statistical evaluation in clinical trials. 
In this article, we constructed a three-level HBJM without considering any hierarchy related to treatment conditions because our goal was to compute the CE for each individual from blinded data at one point of assessment. The HBJM can be extended to compute CE from blinded data at different time points. In this case, the CE includes the parameters of all the assessment modalities at multiple time points to capture treatment effects over time and can be used to improve statistical evaluation of treatment efficacy. The HBJM can also be extended to compute CE from unblinded data in clinical trials with multiple endpoints at multiple time points. In this case, the HBJM takes into account of the treatment arms. Further research is necessary to investigate the utilities of these two forms of HBJM in clinical trials. 
In conclusion, the HBJM extracts and optimally combines valuable information about covariance of CS and VA parameters and improves precision of the estimated parameters. By combining signals and reducing noise from multiple tests for detecting vision changes, this framework exhibits potential to improve statistical power for combining multimodality data in ophthalmic trials. 
Acknowledgments
Supported by the National Eye Institute (EY017491 and EY032125). 
Commercial relationships: L.A.L and M.D. have intellectual and equity interests, and hold employment in Adaptive Sensory Technology, Inc. (San Diego, CA). L.A.L and M.D. hold employment in AST. Z.-L.L. has intellectual and equity interests in Adaptive Sensory Technology, Inc. (San Diego, CA) and Jiangsu Juehua Medical Technology, LTD (Jiangsu, China). Y.Z. has no competing interest. 
Corresponding author: Zhong-Lin Lu. 
Email: zhonglin@nyu.edu. 
Address: Center for Neural Science, New York University, 4 Washington Place, New York, NY 10003, USA. 
References
Abou-Hanna, J. J., Andrews, C. A., Khan, N. W., Musch, D. C., & Jayasundera, K. T. (2021). Calculation of test-retest variability in phase I/IIa clinical trials for inherited retinal degenerations. Ophthalmic Genetics, 42(3), 283–290, https://doi.org/10.1080/13816810.2021.1897848. [CrossRef] [PubMed]
Ahn, W.-Y., Krawitz, A., Kim, W., Busmeyer, J. R., & Brown, J. W. (2011). A model-based fMRI analysis with hierarchical Bayesian parameter estimation. Journal of Neuroscience, Psychology, and Economics, 4(2), 95–110. [CrossRef] [PubMed]
Alexander, K. R., Derlacki, D. J., & Fishman, G. A. (1995). Visual acuity vs letter contrast sensitivity in retinitis pigmentosa. Vision Research, 35(10), 1495–1499, https://doi.org/10.1016/0042-6989(95)98729-S. [CrossRef] [PubMed]
Allingham, M. J., Mettu, P. S., & Cousins, S. W. (2022). Phase 1 clinical trial of elamipretide in intermediate age-related macular degeneration and high-risk drusen: Reclaim high-risk drusen study. Ophthalmology Science, 2(1), https://doi.org/10.1016/j.xops.2021.100095.
Anderson, T. W. (2003). An introduction to multivariate analysis. Hoboken, N.J.: Wiley-Interscience.
Ando, T. (2007). Bayesian predictive information criterion for the evaluation of hierarchical Bayesian and empirical bayes models. Biometrika, 94(2), 443–458, https://doi.org/10.1093/biomet/asm017. [CrossRef]
Ando, T. (2011). Predictive Bayesian model selection. American Journal of Mathematical and Management Sciences, 31(1–2), 13–38, https://doi.org/10.1080/01966324.2011.10737798.
Bellmann, C., Unnebrink, K., Rubin, G. S., Miller, D., & Holz, F. G. (2003). Visual acuity and contrast sensitivity in patients with neovascular age-related macular degeneration. Graefe's Archive for Clinical and Experimental Ophthalmology, 241(12), 968–974, https://doi.org/10.1007/s00417-003-0689-6.
Brown, B., & Lovie-Kitchin, J. E. (1989). High and low contrast acuity and clinical contrast sensitivity tested in a normal population. Optometry and Vision Science, 66(7), 467–473, https://doi.org/10.1097/00006324-198907000-00010.
Cassey, P. J., Gaut, G., Steyvers, M., & Brown, S. D. (2016). A generative joint model for spike trains and saccades during perceptual decision-making. Psychonomic Bulletin & Review, 23(6), 1757–1778, https://doi.org/10.3758/s13423-016-1056-z. [PubMed]
Clayton, D., & Hills, M. (1993). Statistical models in epidemiology. Oxford, UK: Oxford University Press.
Cook, N., Hansen, A. R., Siu, L. L., & Razak, Abdul, A. R. (2015). Early phase clinical trials to identify optimal dosing and safety. Molecular Oncology, 9(5), 997–1007, https://doi.org/10.1016/j.molonc.2014.07.025. [PubMed]
Cormack, F. K., Tovee, M., & Ballard, C. (2000). Contrast sensitivity and visual acuity in patients with Alzheimer's disease. International Journal of Geriatric Psychiatry, 15(7), 614–620, https://doi.org/10.1002/1099-1166(200007)15:7<614::AID-GPS153>3.0.CO;2-0. [PubMed]
Cukras, C., Wiley, H. E., Jeffrey, B. G., Sen, H. N., Turriff, A., Zeng, Y., &hellip; Sieving, P. A. (2018). Retinal AAV8-RS1 gene therapy for X-linked retinoschisis: Initial findings from a Phase I/IIa trial by intravitreal delivery. Molecular Therapy: The Journal of the American Society of Gene Therapy, 26(9), 2282–2294, https://doi.org/10.1016/j.ymthe.2018.05.025. [PubMed]
da Cruz, L., Fynes, K., Georgiadis, O., Kerby, J., Luo, Y. H., Ahmado, A., &hellip; Coffey, P. J. (2018). Phase 1 clinical study of an embryonic stem cell-derived retinal pigment epithelium patch in age-related macular degeneration. Nature Biotechnology, 36(4), 328–337, https://doi.org/10.1038/nbt.4114. [PubMed]
Edwards, W., Lindman, H., & Savage, L. J. (1963). Bayesian statistical inference for psychological research. Psychological Review, 70(3), 193–242, https://doi.org/10.1037/h0044139.
Elliott, D. B., & Situ, P. (1998). Visual acuity versus letter contrast sensitivity in early cataract. Vision Research, 38(13), 2047–2052, https://doi.org/10.1016/S0042-6989(97)00382-9. [PubMed]
Gelman, A., & Rubin, D. B. (1992). Inference from iterative simulation using multiple sequences. Statistical Science, 7(4), 457–472.
Green, D. M., & Swets, J. A. (1966). Signal detection theory and psychophysics (pp. xi, 455). Hoboken, NJ: John Wiley.
Gu, H., Kim, W., Hou, F., Lesmes, L. A., Pitt, M. A., Lu, Z.-L., & et al. (2016). A hierarchical Bayesian approach to adaptive vision testing: A case study with the contrast sensitivity function. Journal of Vision, 16(6), 15, https://doi.org/10.1167/16.6.15. [PubMed]
de Guimaraes, T. A. C., Georgiou, M., Bainbridge, J. W. B., & Michaelides, M. (2021). Gene therapy for neovascular age-related macular degeneration: Rationale, clinical trials and future directions. British Journal of Ophthalmology, 105(2), 151–157, https://doi.org/10.1136/bjophthalmol-2020-316195.
Haegerstrom-Portnoy, G., Schneck, M. E., & Brabyn, J. A. (1999). Seeing into old age: Vision function beyond acuity. Optometry and Vision Science: Official Publication of the American Academy of Optometry, 76(3), 141–158, https://doi.org/10.1097/00006324-199903000-00014. [PubMed]
Hawkins, A. S., Szlyk, J. P., Ardickas, Z., Alexander, K. R., & Wilensky, J. T. (2003). Comparison of contrast sensitivity, visual acuity, and humphrey visual field testing in patients with glaucoma. Journal of Glaucoma, 12(2), 134. [PubMed]
Hou, F., Huang, C.-B., Lesmes, L., Feng, L.-X., Tao, L., Zhou, Y.-F., & et al. (2010). qCSF in clinical application: Efficient characterization and classification of contrast sensitivity functions in amblyopia. Investigative Ophthalmology & Visual Science, 51(10), 5365–5377, https://doi.org/10.1167/iovs.10-5468. [PubMed]
Hou, F., Lesmes, L. A., Kim, W., Gu, H., Pitt, M. A., Myung, J. I., & et al. (2016). Evaluating the performance of the quick CSF method in detecting contrast sensitivity function changes. Journal of Vision, 16(6), 18, https://doi.org/10.1167/16.6.18. [PubMed]
Hou, F., Lesmes, L., Bex, P., Dorr, M., & Lu, Z.-L. (2015). Using 10AFC to further improve the efficiency of the quick CSF method. Journal of Vision, 15(9), 2, https://doi.org/10.1167/15.9.2. [PubMed]
Huang, D., Swanson, E., Lin, C., Schuman, J., Stinson, W., & Fujimoto, J. (1991). Optical coherence tomography. Science, 254(5035), 1178–1181, https://doi.org/10.1126/science.1957169. [PubMed]
Jia, W., Zhou, J., Lu, Z.-L., Lesmes, L. A., & Huang, C.-B. (2015). Discriminating anisometropic amblyopia from myopia based on interocular inhibition. Vision Research, 114, 135–141, https://doi.org/10.1016/j.visres.2015.02.003. [PubMed]
Kim, W., Pitt, M. A., Lu, Z.-L., Steyvers, M., & Myung, J. I. (2014). A hierarchical adaptive approach to optimal experimental design. Neural Computation, 26(11), 2465–2492, https://doi.org/10.1162/NECO_a_00654. [PubMed]
Kiser, A. K., Mladenovich, D., Eshraghi, F., Bourdeau, D., & Dagnelie, G. (2005). Reliability and consistency of visual acuity and contrast sensitivity measures in advanced eye disease. Optometry and Vision Science, 82(11), 946–954, https://doi.org/10.1097/01.opx.0000187863.12609.7b.
Kramer, B. S., Wilentz, J., Alexander, D., Burklow, J., Friedman, L. M., Hodes, R., &hellip; Straus, S. E. (2006). Getting it right: Being smarter about clinical trials. PLOS Medicine, 3(6), e144, https://doi.org/10.1371/journal.pmed.0030144. [PubMed]
Kruschke, J. K. (2014). Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan. (2nd ed.). Cambridge, MA: Academic Press.
Kruschke, J. K., & Liddell, T. M. (2018). Bayesian data analysis for newcomers. Psychonomic Bulletin & Review, 25(1), 155–177, https://doi.org/10.3758/s13423-017-1272-1. [PubMed]
Lee, M. D. (2006). A hierarchical Bayesian model of human decision-making on an optimal stopping problem. Cognitive Science, 30(3), 1–26, https://doi.org/10.1207/s15516709cog0000_69. [PubMed]
Lesmes, L. A., Bex, P., Lu, Z.-L., Bittner, A. K., Ramulu, P. Y., Stellmann, J.-P., & et al. (2016). A survey of contrast sensitivity in visual neuropathology. Investigative Ophthalmology & Visual Science, 57(12), 5161–5161. [PubMed]
Lesmes, L. A., & Dorr, M. (2019). Active learning for visual acuity testing. Proceedings of the 2nd International Conference on Applications of Intelligent Systems, 1–6, https://doi.org/10.1145/3309772.3309798.
Lesmes, L. A., Lu, Z.-L., Baek, J., & Albright, T. D. (2010). Bayesian adaptive estimation of the contrast sensitivity function: The quick CSF method. Journal of Vision, 10(3), 17.1–21, https://doi.org/10.1167/10.3.17. [PubMed]
Lord, S. R. (2006). Visual risk factors for falls in older people. Age and Ageing, 35(Suppl 2), ii42–ii45, https://doi.org/10.1093/ageing/afl085. [PubMed]
McLeod, C., Norman, R., Litton, E., Saville, B. R., Webb, S., & Snelling, T. L. (2019). Choosing primary endpoints for clinical trials of health care interventions. Contemporary Clinical Trials Communications, 16, 100486, https://doi.org/10.1016/j.conctc.2019.100486. [PubMed]
Merkle, E. C., Smithson, M., & Verkuilen, J. (2011). Hierarchical models of simple mechanisms underlying confidence in decision making. Journal of Mathematical Psychology, 55(1), 57–67, https://doi.org/10.1016/j.jmp.2010.08.011.
Midena, E., Angeli, C. D., Blarzino, M. C., Valenti, M., & Segato, T. (1997). Macular function impairment in eyes with early age-related macular degeneration. Investigative Ophthalmology & Visual Science, 38(2), 469–477. [PubMed]
Molloy, M. F., Bahg, G., Li, X., Steyvers, M., Lu, Z.-L., & Turner, B. M. (2018). Hierarchical Bayesian analyses for modeling BOLD time series data. Computational Brain & Behavior, 1(2), 184–213, https://doi.org/10.1007/s42113-018-0013-5.
Molloy, M. F., Bahg, G., Lu, Z.-L., & Turner, B. M. (2019). Individual differences in the neural dynamics of response inhibition. Journal of Cognitive Neuroscience, 31(12), 1976–1996, https://doi.org/10.1162/jocn_a_01458. [PubMed]
Nordhausen, K., Sirkia, S., Oja, H., & Tyler, D. E. (2018). Tools for Multivariate Nonparametrics.Package ‘ICSNP’ in CRAN repository, https://cran.r-project.org/package=ICSNP.
Onal, S., Yenice, O., Cakir, S., & Temel, A. (2008). FACT contrast sensitivity as a diagnostic tool in glaucoma: FACT contrast sensitivity in glaucoma. International Ophthalmology, 28(6), 407–412, https://doi.org/10.1007/s10792-007-9169-z. [PubMed]
Ou, W. C., Lesmes, L. A., Christie, A. H., Denlar, R. A., & Csaky, K. G. (2021). Normal- and low-luminance automated quantitative contrast sensitivity assessment in eyes with age-related macular degeneration. American Journal of Ophthalmology, 226, 148–155, https://doi.org/10.1016/j.ajo.2021.01.017. [PubMed]
Owsley, C., & McGwin, G. (2010). Vision and driving. Vision Research, 50(23), 2348–2361, https://doi.org/10.1016/j.visres.2010.05.021. [PubMed]
Palestro, J. J., Bahg, G., Sederberg, P. B., Lu, Z.-L., Steyvers, M., & Turner, B. M. (2018). A tutorial on joint models of neural and behavioral measures of cognition. Journal of Mathematical Psychology, 84, 20–48, https://doi.org/10.1016/j.jmp.2018.03.003.
Plummer, M. (2003). JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling. Proceedings of the 3rd International Workshop on Distributed Statistical Computing. 124(125.10), 1–10.
R Core Team. (2003). R: A language and environment for statistical computing. R Foundation for Statistical Computing, https://www.R-project.org/.
Ramulu, P. Y., Dave, P., & Friedman, D. S. (2015). Precision of contrast sensitivity testing in glaucoma. Investigative Ophthalmology & Visual Science, 56(7), 2225–2225.
Reichel, F. F., Michalakis, S., Wilhelm, B., Zobor, D., Muehlfriedel, R., Kohl, S., & Fischer, D. (2022). Three-year results of phase I retinal gene therapy trial for CNGA3-mutated achromatopsia: Results of a non randomised controlled trial. The British Journal of Ophthalmology, 106(11), 1567–1572, https://doi.org/10.1136/bjophthalmol-2021-319067. [PubMed]
Roark, M. W., & Stringham, J. M. (2019). Visual performance in the “Real World”: Contrast sensitivity, visual acuity, and effects of macular carotenoids. Molecular Nutrition & Food Research, 63(15), 1801053, https://doi.org/10.1002/mnfr.201801053.
Rohaly, A. M., & Owsley, C. (1993). Modeling the contrast-sensitivity functions of older adults. Journal of the Optical Society of America. A, Optics and Image Science, 10(7), 1591–1599, https://doi.org/10.1364/josaa.10.001591. [PubMed]
Rosenberger, W. F., & Haines, L. M. (2002). Competing designs for phase I clinical trials: A review. Statistics in Medicine, 21(18), 2757–2770, https://doi.org/10.1002/sim.1229. [PubMed]
Rouder, J. N., & Lu, J. (2005). An introduction to Bayesian hierarchical models with an application in the theory of signal detection. Psychonomic Bulletin & Review, 12(4), 573–604, https://doi.org/10.3758/bf03196750. [PubMed]
Rouder, J. N., Sun, D. C., Speckman, P. L., Lu, J., & Zhou, D. (2003). A hierarchical Bayesian statistical framework for response time distributions. Psychometrika, 68(4), 589–606, https://doi.org/10.1007/BF02295614.
Rubin, G. S., Roche, K. B., Prasada-Rao, P., & Fried, L. P. (1994). Visual impairment and disability in older adults. Optometry and Vision Science, 71(12), 750.
Shandiz, J. H., Derakhshan, A., Daneshyar, A., Azimi, A., Moghaddam, H. O., Yekta, A. A., & Esmaily, H. (2011). Effect of cataract type and severity on visual acuity and contrast sensitivity. Journal of Ophthalmic & Vision Research, 6(1), 26–31. [PubMed]
Song, M., Behmanesh, I., Moaveni, B., & Papadimitriou, C. (2020). Accounting for modeling errors and inherent structural variability through a hierarchical Bayesian model updating approach: An overview. Sensors, 20(14), 3874, https://doi.org/10.3390/s20143874.
Stellmann, J. P., Young, K. L., Pottgen, J., Dorr, M., & Heesen, C. (2015). Introducing a new method to assess vision: Computer-adaptive contrast-sensitivity testing predicts visual functioning better than charts in multiple sclerosis patients. Multiple Sclerosis Journal - Experimental, Translational and Clinical, 1, 2055217315596184, https://doi.org/10.1177/2055217315596184. [PubMed]
Storer, B. E. (1989). Design and analysis of phase I clinical trials. Biometrics, 45(3), 925–937, https://doi.org/10.2307/2531693. [PubMed]
Swets, J. A., & Pickett, R. M. (1982). Evaluation of diagnostic systems: Methods from signal detection theory. New York: Academic Press.
Thall, P. F. (2008). A review of phase 2–3 clinical trial designs. Lifetime Data Analysis, 14(1), 37–53, https://doi.org/10.1007/s10985-007-9049-x. [PubMed]
Torres-Saavedra, P. A., & Winter, K. A. (2022). An overview of phase 2 clinical trial designs. International Journal of Radiation Oncology*Biology*Physics, 112(1), 22–29, https://doi.org/10.1016/j.ijrobp.2021.07.1700. [PubMed]
Turner, B. M. (2015). Constraining cognitive abstractions through Bayesian modeling. In An introduction to model-based cognitive neuroscience (pp. 199–220). Berlin: Springer.
Turner, B. M., Forstmann, B. U., Wagenmakers, E.-J., Brown, S. D., Sederberg, P. B., & Steyvers, M. (2013). A Bayesian framework for simultaneously modeling neural and behavioral data. NeuroImage, 72, 193–206, https://doi.org/10.1016/j.neuroimage.2013.01.048. [PubMed]
Turner, B. M., Rodriguez, C. A., Norcia, T. M., McClure, S. M., & Steyvers, M. (2016). Why more is better: Simultaneous modeling of EEG, fMRI, and behavioral data. Neuroimage, 128, 96–115, https://doi.org/10.1016/j.neuroimage.2015.12.030. [PubMed]
Turner, B. M., Van Maanen, L., & Forstmann, B. U. (2015). Informing cognitive abstractions through neuroimaging: The neural drift diffusion model. Psychological Review, 122(2), 312. [PubMed]
Turner, B. M., Wang, T., & Merkle, E. C. (2017). Factor analysis linking functions for simultaneously modeling neural and behavioral data. NeuroImage, 153, 28–48. [PubMed]
Umscheid, C. A., Margolis, D. J., & Grossman, C. E. (2011). Key concepts of clinical trials: A narrative review. Postgraduate Medicine, 123(5), 194–204, https://doi.org/10.3810/pgm.2011.09.2475. [PubMed]
US Department of Health and Human Services, Food and Drug Administration (FDA), & Center for Devices and Radiological Health. (2007). Statistical Guidance on Reporting Results from Studies Evaluating Diagnostic Tests—Guidance for Industry and FDA Staff (p. Appendices 7.1). Diagnostic Devices Branch Division of Biostatistics Office of Surveillance and Biometrics.
US Department of Health and Human Services, Food and Drug Administration (FDA), Center for Drug Evaluation and Research, & Center for Biologics Evaluation and Research (CBER). (1998). Guidance for Industry E9 Statistical Principles for Clinical Trials.
US Department of Health and Human Services, Food and Drug Administration (FDA), Center for Drug Evaluation and Research, & Center for Biologics Evaluation and Research (CBER). (2022). Multiple Endpoints in Clinical Trials Guidance for Industry. US Department of Health and Human Services, Food and Drug Administration (FDA), Center for Drug Evaluation and Research (CDER) and Center for Biologics Evaluation and Research (CBER).
Vingopoulos, F., Wai, K. M., Katz, R., Vavvas, D. G., Kim, L. A., & Miller, J. B. (2021). Measuring the contrast sensitivity function in non-neovascular and neovascular age-related macular degeneration: The quantitative contrast sensitivity function test. Journal of Clinical Medicine, 10(13), Article 13, https://doi.org/10.3390/jcm10132768.
Vujosevic, S., Parra, M. M., Hartnett, M. E., O'Toole, L., Nuzzi, A., Limoli, C., &hellip; Nucci, P. (2023). Optical coherence tomography as retinal imaging biomarker of neuroinflammation/neurodegeneration in systemic disorders in adults and children. Eye, 37(2), 203–219, https://doi.org/10.1038/s41433-022-02056-9.
Wai, K. M., Vingopoulos, F., Garg, I., Kasetty, M., Silverman, R. F., Katz, R., & Miller, J. B. (2022). Contrast sensitivity function in patients with macular disease and good visual acuity. British Journal of Ophthalmology, 106(6), 839–844, https://doi.org/10.1136/bjophthalmol-2020-318494.
Watson, A. B., & Ahumada, A. J. (2005). A standard model for foveal detection of spatial contrast. Journal of Vision, 5(9), 717–740, https://doi.org/10.1167/5.9.6. [PubMed]
West, S. K., Rubin, G. S., Broman, A. T., Muñoz, B., Bandeen-Roche, K., & Turano, K. (2002). How does visual impairment affect performance on tasks of everyday life? The see project. Salisbury eye evaluation. Archives of Ophthalmology (Chicago, Ill.: 1960), 120(6), 774–780, https://doi.org/10.1001/archopht.120.6.774. [PubMed]
Whittaker, S. G., & Lovie-Kitchin, J. (1993). Visual requirements for reading. Optometry and Vision Science: Official Publication of the American Academy of Optometry, 70(1), 54–65, https://doi.org/10.1097/00006324-199301000-00010. [PubMed]
Wichmann, F. A., & Hill, N. J. (2001). The psychometric function: I. Fitting, sampling, and goodness of fit. Perception & Psychophysics, 63(8), 1293–1313, https://doi.org/10.3758/BF03194544. [PubMed]
Wilensky, J. T., & Hawkins, A. (2001). Comparison of contrast sensitivity, visual acuity, and Humphrey visual field testing in patients with glaucoma. Transactions of the American Ophthalmological Society, 99, 213–218. [PubMed]
Wilson, J. D., Cranmer, S., & Lu, Z.-L. (2020). A hierarchical latent space network model for population studies of functional connectivity. Computational Brain & Behavior, 3, 384–399, https://doi.org/10.1007/s42113-020-00080-0.
Yin, J., Qin, R., Sargent, D. J., Erlichman, C., & Shi, Q. (2018). A hierarchical Bayesian design for randomized phase II clinical trials with multiple groups. Journal of Biopharmaceutical Statistics, 28(3), 451–462, https://doi.org/10.1080/10543406.2017.1321007. [PubMed]
Zhao, Y., Lesmes, L. A., Dorr, M., Bex, P. J., & Lu, Z.-L. (2021). Psychophysical validation of a novel active learning approach for measuring the visual acuity behavioral function. Translational Vision Science & Technology, 10(1), 1–1, https://doi.org/10.1167/tvst.10.1.1. [PubMed]
Zhao, Y., Lesmes, L. A., Dorr, M., & Lu, Z.-L. (2021). Quantifying uncertainty of the estimated visual acuity behavioral function with hierarchical Bayesian modeling. Translational Vision Science & Technology, 10(12), 18, https://doi.org/10.1167/tvst.10.12.18. [PubMed]
Zhao, Y., Lesmes, L. A., Hou, F., & Lu, Z.-L. (2021). Hierarchical Bayesian modeling of contrast sensitivity functions in a within-subject design. Journal of Vision, 21(12), 9, https://doi.org/10.1167/jov.21.12.9. [PubMed]
Zimmern, R. L., Campbell, F. W., & Wilkinson, I. M. (1979). Subtle disturbances of vision after optic neuritis elicited by studying contrast sensitivity. Journal of Neurology, Neurosurgery & Psychiatry, 42(5), 407–412, https://doi.org/10.1136/jnnp.42.5.407.
Figure 1.
 
Illustration of a two-dimensional CE. (A) The gray iso-density contours represent two-dimensional distributions of a VA-CSF CE in two conditions. The blue and purple curves represent VA and CSF distributions computed from the BIP. The black curves represent the marginal distributions of CE components on the VA and CSF axes, and the projection of CE on the decision axis. The p values for detecting the change between the two conditions are 0.15 based on VA and CSF from BIP, 0.10 based on VA and CSF from CE, and 0.006 based on CE. (B) Signals from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (C) Noise from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (D) Signal-to-noise ratios from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black).
Figure 1.
 
Illustration of a two-dimensional CE. (A) The gray iso-density contours represent two-dimensional distributions of a VA-CSF CE in two conditions. The blue and purple curves represent VA and CSF distributions computed from the BIP. The black curves represent the marginal distributions of CE components on the VA and CSF axes, and the projection of CE on the decision axis. The p values for detecting the change between the two conditions are 0.15 based on VA and CSF from BIP, 0.10 based on VA and CSF from CE, and 0.006 based on CE. (B) Signals from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (C) Noise from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black). (D) Signal-to-noise ratios from BIP analyses of VA (blue) and CSF (purple), and from CE components and CE (black).
Figure 2.
 
(A) The VABF with two parameters: threshold \(\theta _{ij,Threshold}^{VA}\;\)and range \(\theta _{ij,Range}^{VA}\). (B) Simulated row-by-row data generated by the VABF model in a VA assessment. (C) The log parabola model of the CSF with three parameters: peak gain \(\theta _{ij,PG}^{CSF}\), peak spatial frequency \(\theta _{ij,PF}^{CSF}\), and bandwidth \(\theta _{ij,BH}^{CSF}\). (D) Psychometric functions at different spatial frequencies in a CSF test. (E) Simulated row-by-row data generated by the CSF model in a CSF assessment.
Figure 2.
 
(A) The VABF with two parameters: threshold \(\theta _{ij,Threshold}^{VA}\;\)and range \(\theta _{ij,Range}^{VA}\). (B) Simulated row-by-row data generated by the VABF model in a VA assessment. (C) The log parabola model of the CSF with three parameters: peak gain \(\theta _{ij,PG}^{CSF}\), peak spatial frequency \(\theta _{ij,PF}^{CSF}\), and bandwidth \(\theta _{ij,BH}^{CSF}\). (D) Psychometric functions at different spatial frequencies in a CSF test. (E) Simulated row-by-row data generated by the CSF model in a CSF assessment.
Figure 3.
 
Illustrations of the test stimuli in one row used in the (A) qVA and (B) qCSF.
Figure 3.
 
Illustrations of the test stimuli in one row used in the (A) qVA and (B) qCSF.
Figure 4.
 
(A) The generative model of visual acuity specifies the VABF of individual i in test j with two parameters: \(\theta _{ij}^{VA} = ( {\theta _{ij,Threshold}^{VA},\theta _{ij,Range}^{VA}} )\), where \(\theta _{ij,Threshold}^{VA}\) and \(\theta _{ij,Range}^{VA}\) are the VA threshold and VA range. (B) The generative model of CSF specifies the CSF of individual i in test j with three parameters\(:\;\theta _{ij}^{CSF} = ( {\theta _{ij,PG}^{CSF},\theta _{ij,PF}^{CSF},\theta _{ij,BH}^{CSF}} )\), where \(\theta _{ij,PG}^{CSF}\;\)is the peak sensitivity, \(\theta _{ij,PF}^{CSF}\) is the peak spatial frequency (cycles/degree), and \(\theta _{ij,BH}^{CSF}\) is the bandwidth (octaves) at half of the peak sensitivity. (C) A three-level HBJM of the VABF and CSF across multiple individuals and tests. At the population level, μCE and Σ are the mean and covariance hyperparameters of the population. At the individual level \(\rho _i^{{\rm{CE}}}\), ϕCSF, and ϕVA are the mean and covariance hyperparameters of individual i. At the test level, \(\theta _{ij}^{CE}\) are the parameters of individual i in test j.
Figure 4.
 
(A) The generative model of visual acuity specifies the VABF of individual i in test j with two parameters: \(\theta _{ij}^{VA} = ( {\theta _{ij,Threshold}^{VA},\theta _{ij,Range}^{VA}} )\), where \(\theta _{ij,Threshold}^{VA}\) and \(\theta _{ij,Range}^{VA}\) are the VA threshold and VA range. (B) The generative model of CSF specifies the CSF of individual i in test j with three parameters\(:\;\theta _{ij}^{CSF} = ( {\theta _{ij,PG}^{CSF},\theta _{ij,PF}^{CSF},\theta _{ij,BH}^{CSF}} )\), where \(\theta _{ij,PG}^{CSF}\;\)is the peak sensitivity, \(\theta _{ij,PF}^{CSF}\) is the peak spatial frequency (cycles/degree), and \(\theta _{ij,BH}^{CSF}\) is the bandwidth (octaves) at half of the peak sensitivity. (C) A three-level HBJM of the VABF and CSF across multiple individuals and tests. At the population level, μCE and Σ are the mean and covariance hyperparameters of the population. At the individual level \(\rho _i^{{\rm{CE}}}\), ϕCSF, and ϕVA are the mean and covariance hyperparameters of individual i. At the test level, \(\theta _{ij}^{CE}\) are the parameters of individual i in test j.
Figure 5.
 
(A) Scatter plots of xs,B (red dots) from the base-line distribution p(x|B) and xs,T (blue dots) from the treatment distribution p(x|T). Black dashed line represents a criterion of \(\frac{{pdf( {x{\rm{|}}B} )}}{{pdf( {x{\rm{|}}T} )}} = 1\) and the iso-density contours illustrate the best fitting Gaussian mixture models to the two distributions. (B) The bold black curve is the ROC curve. The grey area under the ROC is the AUROC, a criterion-free estimate of the test accuracy. Black dashed line represents chance performance.
Figure 5.
 
(A) Scatter plots of xs,B (red dots) from the base-line distribution p(x|B) and xs,T (blue dots) from the treatment distribution p(x|T). Black dashed line represents a criterion of \(\frac{{pdf( {x{\rm{|}}B} )}}{{pdf( {x{\rm{|}}T} )}} = 1\) and the iso-density contours illustrate the best fitting Gaussian mixture models to the two distributions. (B) The bold black curve is the ROC curve. The grey area under the ROC is the AUROC, a criterion-free estimate of the test accuracy. Black dashed line represents chance performance.
Figure 6.
 
Two-dimensional marginal posterior distributions of ηCE.
Figure 6.
 
Two-dimensional marginal posterior distributions of ηCE.
Figure 7.
 
Two-dimensional marginal posterior distributions of \(\tau _i^{{\rm{CE}}}\) for one individual from the HBJM.
Figure 7.
 
Two-dimensional marginal posterior distributions of \(\tau _i^{{\rm{CE}}}\) for one individual from the HBJM.
Figure 8.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{{\rm{CE}}}\) at the test level for the same individual in Figure 7 from the HBJM.
Figure 8.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{{\rm{CE}}}\) at the test level for the same individual in Figure 7 from the HBJM.
Figure 9.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{VA}\) and \(\theta _{i1}^{CSF}\) for the same individual in Figure 7 from the BIP.
Figure 9.
 
Two-dimensional marginal posterior distributions of \(\theta _{i1}^{VA}\) and \(\theta _{i1}^{CSF}\) for the same individual in Figure 7 from the BIP.
Figure 10.
 
Histograms of the variance of \(\theta _{i1}^{{\rm{BIP}}}\) and \(\theta _{i1}^{{\rm{CE}}}\) components.
Figure 10.
 
Histograms of the variance of \(\theta _{i1}^{{\rm{BIP}}}\) and \(\theta _{i1}^{{\rm{CE}}}\) components.
Figure 11.
 
(A–C) Average signal between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D across 14 eyes. (D–F) Average signal between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (G–I) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D. (J–L) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (1dB = 0.1 log10 units).
Figure 11.
 
(A–C) Average signal between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D across 14 eyes. (D–F) Average signal between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (G–I) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for VA1D and the VA component of CE5D. (J–L) Average variance of the difference distributions between F1 and F2, F1 and F3, and F2 and F3 for AULCSF1D and AULCSF based on CE5D. (1dB = 0.1 log10 units).
Figure 12.
 
Medium p values between F1 and F2 (A), F1 and F3 (B,), and F2 and F3 (C) for VA1D , AULCSF1D, and CE5D in the 14 eyes.
Figure 12.
 
Medium p values between F1 and F2 (A), F1 and F3 (B,), and F2 and F3 (C) for VA1D , AULCSF1D, and CE5D in the 14 eyes.
Figure 13.
 
(A–C) Sensitivity at 95% specificity between F1-F2, F1-F3, and F2-F3 condition pairs as functions of sample size for VA1D, AULCSF1D, and CE5D. (D–F) Accuracy between F1-F2, F1-F3, and F2-F3 condition pairs. (G–I) p value between F1-F2, F1-F3, and F2-F3 condition pairs.
Figure 13.
 
(A–C) Sensitivity at 95% specificity between F1-F2, F1-F3, and F2-F3 condition pairs as functions of sample size for VA1D, AULCSF1D, and CE5D. (D–F) Accuracy between F1-F2, F1-F3, and F2-F3 condition pairs. (G–I) p value between F1-F2, F1-F3, and F2-F3 condition pairs.
Table 1.
 
Correlations of ηCEcomponents at the population level.
Table 1.
 
Correlations of ηCEcomponents at the population level.
Table 2.
 
Correlations of \(\tau _i^{{\rm{CE}}}\) components, averaged across all individuals and with SD in parentheses.
Table 2.
 
Correlations of \(\tau _i^{{\rm{CE}}}\) components, averaged across all individuals and with SD in parentheses.
Table 3.
 
Correlations of \(\theta _{i1}^{{\rm{CE}}}\) components, averaged across all tests and with SD in parentheses.
Table 3.
 
Correlations of \(\theta _{i1}^{{\rm{CE}}}\) components, averaged across all tests and with SD in parentheses.
Table 4.
 
Correlations of \(\theta _{i1}^{CSF}\) and\(\;\theta _{i1}^{VA}\) components, averaged across all tests and with SD in parentheses.
Table 4.
 
Correlations of \(\theta _{i1}^{CSF}\) and\(\;\theta _{i1}^{VA}\) components, averaged across all tests and with SD in parentheses.
Table 5.
 
Average (SE) sensitivity (%) at 95% specificity of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 5.
 
Average (SE) sensitivity (%) at 95% specificity of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 6.
 
Average (SE) accuracy (%) of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 6.
 
Average (SE) accuracy (%) of VA1D, AULCSF1D, and CE5D across all eyes between six condition pairs.
Table 7.
 
Number of eyes required to reach significant differences (p < 0.01) based on VA1D, AULCSF1D, and CE5D.
Table 7.
 
Number of eyes required to reach significant differences (p < 0.01) based on VA1D, AULCSF1D, and CE5D.
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×