Here, we use a direct conditional moment approach to measure the joint statistics relevant for the three simple visual tasks illustrated in
Figure 1. The first task is estimation of missing image points (
Figure 1a). The symbols
r,
s,
t, and
u represent observed values along a row of image pixels and
x represents an unobserved value to be estimated. This kind of task arises when some image locations are occluded or a sensor element is missing. A different variant of the task arises when only some of the color values are absent at a location (e.g., the demosaicing task, Brainard, Williams, & Hofer,
2008; Li, Gunturk, & Zhang,
2008, and related tasks, Zhang & Brainard,
2004). The second task is estimation of a high-resolution image from a low-resolution image (
Figure 1b). Again, the symbols
r,
s,
t, and
u represent observed low spatial resolution values obtained by locally averaging and then downsampling a higher resolution image;
x and
y represent unobserved values to be estimated. This kind of task arises naturally in interpreting (decoding) retinal responses in the periphery. For example,
Figure 1b corresponds approximately to the situation around 1-degree eccentricity in the human retina where the sampling by midget (P) ganglion cells is about one-half that in the fovea (one-fourth the samples per unit area). In the computer vision literature, the goal of this task is referred to as “super resolution” (Freeman, Thouis, Jones, & Pasztor,
2002; Glasner, Bagon, & Irani,
2009; Li & Adelson,
2008). The third task is estimation of a missing color channel given the other two. The symbols
s and
t represent the observed color values (e.g.,
G and
B) at a pixel location, and
x represents the unobserved value to be estimated (e.g.,
R). This is not a natural task, but it reveals the redundancy of color in natural scenes and sets an upper limit on how well a dichromat could estimate his missing color channel without knowledge of the spatial correlations in images or the objects in the scene. Furthermore, statistics measured for this task could be of use in other tasks (e.g., demosaicing). We find that the image statistics for these tasks are complex but regular (smooth), are useful for performing the tasks, and make many testable predictions for visual coding and visual performance.