Center For Perceptual Systems
Rubin Face or Vase Illusion
Home Prospective Graduate Students Research Austin Life  UT Home Page What's Happening?
About Wilson Geisler
Bayesian Ideal Observers
Natural Image Statistics
Perceptual Grouping
Motion Perception
Pattern Vision
Space Variant Imaging
Click here
Wilson Geisler Links

Spatial Vision: Single Neurons and Perception
W. S. Geisler and D. G. Albrecht

The topic of spatial vision concerns the fundamental mechanisms within the eye and the brain that analyze and represent the distribution of light across the visual field, with the ultimate goal of understanding how these mechanisms contribute to object recognition and scene interpretation in general.

A wealth of psychophysical and physiological research supports the view that stimulus selectivity plays a fundamental role in spatial vision. Psychophysical studies have provided evidence that the human visual system is selective along a number of stimulus dimensions including orientation, size, position, wavelength (color), speed of motion, direction of motion, and binocular disparity. These studies have shown that there are mechanisms ("channels") selective to different regions along each of these stimulus dimensions. Similarly, neurophysiological and anatomical studies have demonstrated that neurons in the visual pathway are selective along a number of stimulus dimensions and that this selectivity increases from the retina to the primary visual cortex. For example, photoreceptors are selective along a few stimulus dimensions (spatial position, wavelength, temporal frequency), whereas cortical neurons are selective along many stimulus dimensions (spatial position, wavelength, temporal frequency, orientation, spatial frequency, direction of motion, disparity, etc.). Concomitant with this increase in stimulus selectivity, there is an increase in the heterogeneity; that is, there is an increase in the complexity and diversity of the cells along all of the stimulus dimensions. Thus, for example, the intensity response functions of cones are all very similar from cell to cell, whereas the contrast response functions of cortical neurons are quite different from cell to cell.

A number of different explanations have been proposed for this emergence of stimulus selectivity along the visual pathway. One hypothesis is that this progressive selectivity is part of a hierarchical process, ultimately leading to single neurons which respond uniquely to specific real world objects (Barlow, 1972; Barlow, 1995): "The Neuron Doctrine." A second hypothesis is that this selectivity reflects a low redundancy code which is well matched to the statistics of natural images (Barlow, 1961; Barlow, 1989; Field, 1987; Olshausen & Field, 1997): "Sparse Coding."

An alternative hypothesis is that this selectivity is a critical step in segregating objects from their context: "Object Segregation." Objects of interest within the natural environment are generally located within a very complex context of other objects. In order to recognize an object of interest, the parts of the object must be separated from the parts of other objects. For example, to recognize a longhorn bull behind a barbed wire fence, it is necessary to separate the image features that define the wire fence from those that define the bull. Fortunately, context is much less of a problem on a local scale; it is relatively easy to identify the orientation, position, and color of local image contours. The selectivity of visual cortical neurons permits recognition of these local image properties, thus allowing subsequent grouping mechanisms to bind together the contours that define the fence separately from those that define the bull.

These three different explanations for stimulus selectivity are not necessarily incompatible. Object segregation or sparse coding could be a first step in producing single neurons tuned to real world objects. On the other hand, the processing following object segregation or sparse coding could be highly distributed. Further, having neurons matched to the statistics of the natural environment must surely be advantageous for both sparse coding and object segregation, given the constraint of limited resources. However, the goals of sparse coding and object segregation are quite different; hence, the specific selectivities, and how they are implemented, could well be different. It is important to keep all three explanations in mind, given that one's theoretical viewpoint can substantively influence the direction of future research.

In this chapter we will rely upon a wealth of psychophysical and physiological research to develop the topic of spatial vision with two themes in mind. The first theme concerns how stimulus selectivity develops along the visual pathway. The second theme concerns how the anatomical and physiological mechanisms of stimulus selectivity contribute to visual performance, and ultimately, object recognition and scene interpretation.