AVA logo Promoting vision research and its applications
 
  [Home] [About] [Membership] [Awards] [Updates] [Contact Us]

Meetings

Meeting Abstracts

E-bulletins

Vision Links

Vision Resources

AVA meeting at Aston

THE VISUAL BRAIN - Abstracts from meeting on 17 December 1997

Directionality, asymmetry, and infancy

O Braddick (Psychology Dept, University College London, Gower Street, London WC1E 6BT, UK; e-mail: ucjtsol@ucl.ac.uk)

Directional motion sensitivity is a pervasive feature of visual systems and a key element in many of the ways we use vision. However, in human development, behavioural directional discriminations and directionally-specific VEPs emerge later in infancy than sensitivity to static orientation. Possible neurobiological reasons for this sequence will be discussed.

The postnatal emergence of these directional functions needs to be reconciled with the presence from birth of optokinetic nystagmus (OKN) which also depends on directional selectivity. OKN in the newborn has a characteristic signature of monocular asymmetry: for each eye, temporal-to-nasal stimulus motion is much more effective than nasal-to-temporal. This asymmetry has been taken as evidence for an early, purely subcortical mechanism which cannot subserve cortical uses of motion information. Against this view is evidence from unilateral cerebral damage and from VEP responses, presumably cortical, which share this asymmetric signature. New findings from our laboratory show that the VEP and OKN asymmetries are not so simply related as has been supposed. Multiple directional motion systems interact in development, but understanding this interaction remains a challenge.

No 2nd-order input to optic flow perception

P J Bex, D R Simmons¶, I Mareschal# (Dept of Psychology, University of Glasgow, Florentine House, Hillhead Street, Glasgow G12 8QB, UK; ¶ Dept of Vision, Glasgow Caledonian University, City Campus, Cowcaddens Road, Glasgow G4 0BA, UK; #McGill Vision Research Centre, 687 Pine Avenue West, H 4-14, Montreal, Quebec, Canada H3A 1A1; fax: +44 141 339 8889; e-mail: bex@psy.gla.ac.uk)

Growing psychophysical and physiological evidence supports hierarchical models of motion processing in which local velocity signals are assembled to form receptive fields specialised for optic flow components. For example, the equality of detection and identification thresholds for both radiation and rotation (Gurney and Wright, 1996 Perception 25 5) requires the existence of labelled detectors for these patterns, and radiation looks faster than translation composed of identical local speeds (Bex and Makous, 1997 Vision Research in press) implicating two-stage analysis. We inspected 4 micropatterns that formed complex global configurations: whereas 1st-order micropatterns appeared as a single moving object viewed through 4 apertures, there was no such global coherence of 2nd-order micropatterns. This observation prompted a quantitative study of the contribution of 2nd-order motion to optic flow. Detection, identification and direction discrimination thresholds were measured for 1st- and 2nd-order patterns (sinusoidal contrast modulation of static and dynamic noise) of translatory, rotatory and radial motion comprising 4 local micropatterns identical except for orientation. The perceived speed of rotation and radiation was also matched to that of translation. For 1st-order patterns the results confirmed the equality of thresholds and that radiation looked faster than translation or rotation. For 2nd-order stimuli, direction discrimination thresholds were much higher than identification or detection thresholds and all patterns appeared to move at the same speed. The results provide no evidence for labelled detectors for 2nd-order rotation and radiation and suggest that there is no 2nd-order input to the perception of optic flow.

Temporal effects of attention on motion sensitivity

J E Raymond, H L O'Donnell, S P Tipper (School of Psychology, University of Wales, Bangor LL57 2DG, UK; e-mail: pss117@bangor.ac.uk)

Attention studies show that visual selection of a target can have a long lasting effect on identification of a subsequently presented second target. Can temporal effects of selection affect sensitivity to simple features and thereby provide an account for simple successive stimulus interactions? After viewing a brief (200 ms) episode of, say, rightward motion, observers are markedly less sensitive to rightwards motion and more sensitive to leftward motion in a second episode presented a short time later. We investigated attentional modulation of this successive stimulus interaction using a dual-task priming/adaptation paradigm. We asked healthy adults to select one direction from a briefly presented dynamic random dot display containing two orthogonal and transparent directions and then to judge the direction of motion in a second dot display containing a single direction presented after a brief interval. The coherence of the second display was varied. The interval between the two episodes was also varied. We found large losses in sensitivity to previously selected directions combined with enhanced sensitivity to previously ignored directions. There was no change in sensitivity for directions opposite to either previously seen directions. These effects can not be explained by notions of motion opponency or low-level data-driven sensory adaptation. They indicate that temporal effects of attention can dramatically alter sensitivity to simple stimulus features and provide a reasonable account for successive direction contrast effects.

Modelling biological motion analysis in terms of image flow

A Johnston, P W McOwan¶ (Department of Psychology, University College London, Gower Street, London WC1E 6BT, UK; ¶ and Department of Cybernetics, University of Reading, Reading, UK; fax: +44 171 436 4276; e-mail: a.johnston@ucl.ac.uk)

Models of biological motion analysis are typically based on correspondence matches between separate frames of an image sequence, detection of spatio-temporal orientation by matched filters (motion energy analysis) or the orientation of spatio-temporal image gradients. Here we consider an alternative approach based on the computation of image flow over segments of boundaries in the image. As a two dimensional spatial image moves over a boundary segment, it develops a spatio-temporal image which contains a history of the flow over the boundary. Integrating over this spatio-temporal region gives the total flow over the segment in a given temporal interval. We can also calculate the difference in the amount of light within the region at the beginning and end of the interval. Assuming conservation of the image brightness we can calculate the speed and direction of motion from the change in the image brightness over the interval and the various flows across segments of the boundary of the region in the image. Consideration is given to how this strategy might be implemented in the human and primate visual systems. Supported by the BBSRC/EPSRC Mathematical Biology Initiative.

Spatial segregation and the perception of motion

J Harris (Department of Psychology, University of Reading, Whiteknights, Reading RG6 6AL, UK; e-mail: J.P.Harris@reading.ac.uk)

How does the visual system allocate motion to different regions of the retinal image? The role of spatial information in this process was studied by measuring the strength of the motion aftereffect (by ratings of initial strength and timing of duration), produced by a standard adaptation regime, on a test window (with central fixation point) of previously moving stationary vertical test stripes, as a function of their relationship with stationary vertical surrounding stripes. In Experiment 1, it was found that MAEs were stronger and more durable when the test stripes were offset by 90 deg than when they were perfectly aligned with the surrounding stripes. This effect of offset was stronger than the effect of surrounding the window with a border. In Experiment 2, test stripe offset was varied systematically. MAE strength and duration rose with test stripe offset up to 72 deg, the largest offset between 0 and 180 deg used in this experiment. Although the MAE, on both measures, was reduced when offset was further increased to 180 deg (white stripes aligned with black), it was still higher at that offset than for 0 deg offset (white stripes aligned with white). In contrast, in a separate condition with no prior adaptation to motion, when subjects were asked to rate how separate the test windows appeared from the surround, ratings rose with offset not only up to 72 but then further up to 180 deg. In Experiment 3, which probed the range of offsets between 72 and 180 deg, segregation ratings were again higher for 180 than for 90 deg, but peaked at 166 deg offset, whereas MAE strength peaked at 72 deg offset. In summary, both MAE strength and judged segregation vary with the relationship between the test stripes and the surround. However, the former appears to depend more on size of offset or phase shift between the test and surrounding stripes, whereas the latter appears to depend more on the length of black/white border at the edge of the test window.

The quantized degrees-of-freedom in images and visual codes

J Daugman (The Computer Laboratory, University of Cambridge, Cambridge CB2 3QG, UK; e-mail: john.daugman@CL.cam.ac.uk)

Several classical results in signal processing and information theory can be summarized by the statement that the information contained in continuous, bandlimited signals (such as images) is in fact quantized. Despite the apparent denseness of such signals, the information they contain can be completely ennumerated as a countable list of "quanta". Five familiar examples which have influenced the visual sciences are:

1. Nyquist's (1911) Sampling Theorem: A continuous, bandlimited signal can be completely represented by a discrete set of samples.

2. Hartley's (1928) principle: The number of degrees-of-freedom in a signal of duration T and bandwidth W is at most 2WT.

3. Gabor's (1946) logons: The information diagram for a signal has a quantal grain structure, whose smallest "atoms" of independent data define a non-orthogonal expansion basis using complex-valued wavelets.

4. Shannon's (1948) Channel Coding Theorem: A continuous communication channel can transmit continuous information at a maximum rate (as measured in bits/second) specified by, among other things, its signal-to-noise ratio.

5. Logan's (1977) Theorem: For bandpass signals whose bandwidth is no greater than one octave, their zero-crossings alone (a finite set of points) completely specify the waveform (excluding AM signals) up to a scale factor.

In addition to these classical coding insights, strongly nonlinear ideas such as vector quantization (VQ) exploit the fact that only a tiny fraction of possible image structures actually do occur in real images, so it is more efficient simply to nominate the types that do occur (by matching to a finite discrete codebook) than to code for all possible ones. For the visual sciences, the legacy and influence of these insights about quantized degrees-of-freedom extend to: optical quality in relation to retinal sampling; understanding 2-D receptive field profiles; limits of perceptual uptake; explanations of illusions and isomeres; and modelling the efficiency of visual codes in terms of image compression and recognition. The above family of classical results will be briefly reviewed and applied to a contemporary problem in pattern recognition and computer vision, in which high-confidence visual recognitions of personal identity are achieved in real time simply by detecting a failure of statistical independence in mathematically encoded iris patterns. The decision environments created thereby have a decidability (or detectability, d-prime) index exceeding 11.

Visual perception and the higher-order structure of psychophysical stimuli.

M G A Thomson (Vision Sciences, Aston University, Birmingham B4 7ET, UK; fax: +44 121 333 4220; e-mail: M.G.A.Thomson@aston.ac.uk)

One of the goals of visual psychophysics is to determine a representation which relates the perceptually significant structure of visual stimuli to their physical properties. A useful representation would work equally well for both classical experimental stimuli (bars, gratings, etc.) and `natural' stimuli but has so far proved elusive. For example, there has been much debate over the relative importance of low and high spatial frequencies in bar detection/perception, and the roles of Fourier amplitude, phase and orientation in natural-image perception/discrimination are still unclear. Higher-order statistical image measures have, however, met with some success in predicting psychophysical results for both classical (Klein and Tyler, 1986 Journal of the Optical Society of America A 3 868-879) and natural (Thomson and Foster, 1997 Journal of the Optical Society of America A 14 2081-2090) visual stimuli. The work reported here considered third-order statistics: this is the lowest-order global measure sensitive to both amplitude and relative phase. Computational techniques such as phase randomization, phase quantization, and phase-amplitude hybridization were used to distort a variety of classical and natural psychophysical stimuli. The visual appearance of these distorted stimuli was then related to changes in their third-order structure. The results illustrate the potential value of adopting these higher-order physical measures and will be discussed in terms of likely sensory representations.

How amblyopic vision uses phase and amplitude information in natural images

C A Parraga, G M Kennedy, D J Tolhurst (Department of Physiology, Downing Street, Cambridge, CB2 3EG, UK; e-mail: djt12@cam.ac.uk)

As many as 5% of the population may suffer from amblyopia, a condition in which vision is impaired in one eye without obvious pathologies of the eye or visual pathways and cannot be corrected by refractive means. Unfortunately, simple laboratory or clinical tests of visual performance may not always reflect the full nature of this disability. Our objective was to identify the real extent of the visual disability of amblyopia using stimuli related to everyday visual tasks. A modified two-alternative forced-choice experiment was used to measure sensitivity to changes in natural images caused by systematic perturbations of the phases or amplitudes of the coefficients in their Fourier spectra (Tadmor and Tolhurst, 1994 Vision Research 34 541-554; Thomson and Foster, 1995 Journal of Physiology 485P 25P). Experiments were performed on amblyopic subjects using one eye at a time. Subjects had to discriminate between test images made by gradually exchanging the phase spectra between two different natural images (5 subjects) or by gradually randomising the phase spectrum (7 different subjects). Both kinds of phase perturbation were much less easily detected when the amblyopic eye was used than when the good eye was used. These effects were equally pronounced when the subject has to discriminate between images with different Fourier amplitude spectra (the second 7 subjects).

Supported by Fight for Sight.

Simplifying images

L D Griffin (Vision Sciences, Aston University, Birmingham, B4 7ET, UK; fax: +44 121 333 4220; e-mail: l.d.griffin@aston.ac.uk)

Many schemes for simplifying images have been proposed; the most notable, based on progressive Gaussian blurring, may be achieved either by `viewing' the image through apertures of increasing size or by running the diffusion equation Lt=Lxx+L yy (where the subscripts indicated differentiation with respect to time t and position x, y). Other simplification schemes can be cast only in the equational form and not in the aperture form. For example, mean curvature flow (MCF) simplifies an image by moving isophotes at a speed proportional to their curvature; in equational form, Lt=Lvv, where v is in the direction tangent to the isophote. Recently, Guichard and Morel (1996 Ceremade Technical Report #9335) proved that MCF could be cast as an aperture scheme but only if the apertures were of infinitesimal size and applied iteratively to progressively simplify the image. They proved that MCF is equivalent to iterated infinitesimal (i.i.) median filtering and they noted the parallel with the well-known fact that Gaussian blurring is equivalent to i.i. mean filtering. This suggested a generalization that has now been proved: i.i. mode filtering is equivalent to the evolution scheme Lt=Lvv-2L ww (where the v is as before and w is in the gradient direction). With i.i. mode filtering, unlike i.i. mean filtering or i.i. median filtering, the image stops changing before it becomes constant-valued: instead, the endpoint is a mosaic of constant regions separated by discontinuities. This result may explain why a similar formula, Lt=Lvv-3L ww, proposed by Gabor (1965 Laboratory Investigations 14 801-807) is image-enhancing.

Automatic face representation for unfamiliar-view face recognition

D B Graham, N M Allinson (Dept of Electrical Engineering and Electronics, UMIST, Sackville Street, PO Box 88, Manchester, M60 1QD, UK; fax: +44 161 200 4673; e-mail: danny@sound.ee.ac.uk)

Experiments have shown that there are viewpoint dependencies which affect the nature of human face recognition. These dependencies support the model that humans represent faces as a set of 2D images which are interpolated between and extrapolated beyond during the recognition process. As faces belong to a specific class of objects we can use our prior knowledge of facial structure to facilitate this interpolation and extrapolation.

This paper describes an eigenspace manifold for the automatic representation and recognition of pose-varying faces. We show that the distribution of faces in this manifold allows us to determine an efficient representation which we can then use to characterise faces in the manifold. Faces are characterised by estimating their appearance at novel viewpoints using knowledge gained from other faces undergoing the same change. We show how the degree of representation affects recognition performance and contrast these results with previous results obtained using the same characterisation with a manual representation.

Finally we draw parallels between our system and reported human behaviour including some neurophysiological evidence for view-specific face representation.

Multiple orientation-selective mechanisms for line-target detection

D H Foster, S Westland¶ (Vision Sciences, Aston University, Birmingham, B4 7ET, UK; ¶ Department of Communication and Neuroscience, Keele University, Staffordshire ST5 5BG, UK; fax: +44 121 333 4220; e-mail: D.H.Foster@aston.ac.uk)

Observers can detect a line element differing in orientation from a background of uniformly oriented line elements rapidly and effortlessly. This kind of efficient target detection is thought to involve the early or preattentive stages of visual processing (Treisman, 1985 Computer Vision, Graphics, and Image Processing 31 156-177). The level of performance depends both on the angle between the target and background elements and on the orientations of the background elements themselves (Foster and Ward, 1991 Proceedings of the Royal Society of London B 243 75-81). In an extended psychophysical study, threshold values of the angle between target and background elements were obtained as a function of the orientation of the background elements. An analysis of these threshold functions suggested the involvement of distinct groups of orientation-selective mechanisms: coarse- and medium-scale mechanisms whose preferred orientations were spaced on average at angles of about 90 deg and 35-50 deg respectively in the frontoparallel plane, and which had common alignments over observers; and also fine-scale mechanisms whose preferred orientations were spaced on average at angles of 10-25 deg, but whose alignments varied randomly from observer to observer, possibly reflecting individual variations in neuronal sampling characteristics. Increasing effective display duration produced a shift in activity from coarser- to finer-scale mechanisms, a result consistent with the notion that the visual system applies a process of limited refinement in detecting the orientations of lines in scenes.

Orientational anisotropy in line-target detection with and without a gravitational reference for orientation

L M Doherty, D H Foster (Vision Sciences, Aston University, Birmingham, B4 7ET, UK; fax: +44 121 333 4220; e-mail: dohertlm@aston.ac.uk)

When observers detect a uniquely oriented line element (`target') among uniformly oriented line elements (`nontargets'), detection performance depends on element orientation (Treisman and Gormican, 1988 Psychological Review 95 15-48; Foster and Ward, 1991 Proceedings of the Royal Society of London B 243 75-81). When displays of line elements are presented briefly, the orientation increment threshold (the difference between nontarget and target orientations that is required for effective detection) is generally lower with vertical or horizontal nontargets than with tilted nontargets. Thus, the variation of orientation increment threshold with nontarget orientation is periodic with a period of about 90 deg. Does this orientational anisotropy depend on the observer's use of the direction of gravity as a reference for orientation? In an experiment to address this question, observers viewed briefly presented displays of line elements while sitting (so that gravity could be used as a cue defining the vertical in stimulus displays) and while supine (lying horizontally, face upwards, so that gravity provided no such cue). Twenty line elements of length 1 deg visual angle were presented in a circular field of diameter 20 deg visual angle. Nontarget orientations were in the range 0, 5, ..., 175 deg from the vertical. The difference between nontarget and target orientations was varied adaptively. Stimulus displays lasting 40 ms were followed by a blank interstimulus interval lasting 180 ms or less and then a random-line mask. With both sitting and supine observers, orientational anisotropy in thresholds was found. So, it appears that the orientational reference frame for early visual processing can be defined without gravitational information, by retinocentric coding or by awareness of the body axis.

Grouping by proximity or similarity?: competition between the Gestalt principles in vision.

P T Quinlan, R N Wilton¶ (Department of Psychology, University of York, Heslington, York, YO1 5DD, UK; ¶ Psychology, University of Dundee, Dundee, UK; fax: +44 1904 433181; ptq1@york.ac.uk)

This paper examines the nature of the psychological processes that underlie the Gestalt principles of grouping by proximity and grouping by similarity. Similarity was defined relative to the principles of grouping by common colour and grouping by common shape. Subjects were presented with displays comprising a row of seven coloured shapes and were asked to rate the degree to which the central target shape grouped with either the right or left flanking shapes. Across the displays the proximal and featural relationships between the target and flankers were varied.

These ratings reflected persuasive effects of grouping by proximity and common colour: there was only weak evidence for grouping by common shape. Nevertheless, both common colour and common shape were shown to over-ride grouping by proximity, under certain conditions. The data also show that to understand how the Gestalt principles operate, it appears necessary to consider processes that operate within and between groups of elements that are initially identified on the basis of proximity. Whether such groups survive further analysis depends critically on the featural content of the constituent elements.

The spatial frequency tuning of visual contour integration

S C Dakin, R F Hess (McGill Vision Research, Department of Ophthalmology, 687 Pine Avenue West, H4-14, Montreal, Quebec, H3A 1A1, Canada; e-mail: scdakin@vision.mcgill.ca)

This paper examines the mechanism subserving visual contour detection and particularly its tuning for the spatial frequency of contour components. We measured the detection of contours composed of Gabor micro-patterns within a field of randomly oriented distractor elements. Distractors were randomly assigned one of two spatial frequencies and elements lying along the contour alternated between these values. We report that the degree of tolerable spatial frequency difference between successive contour elements is inversely proportional to the orientation difference between them. Spatial frequency tuning (half-width at half-height) for straight contours is around 1.3 octaves but, for contours with a 30° difference between successive elements, drops to around 0.7 octaves. Integration of curved contours appears to operate at a narrower bandwidth. Much orientation information in natural images arises from edges, and we propose that this narrowing of tuning is related to the reduction in inter-scale "support" that accompanies increasing edge curvature.

Contributions of parvo- and magno-signals to cortical responses: selectivity limits.

J J Kulikowski, A G Robson (Visual Sciences Lab. (Mill/G.126), UMIST, P.O. Box 88, Manchester, M60 1QD, UK; fax: 0161-200 4433; e-mail: ymum21@umist.ac.uk)

In a series of studies we have established that low-contrast, low-spatial-frequency gratings selectively activate parvo/magno streams. Responses to the onset of chromatic (isoluminant) gratings are sustained and parvo-like, whereas responses to coarse achromatic (luminance modulated) gratings are transient and magno-like. In this study we further investigate the contrast-dependence of selective stimuli by comparing human visual evoked potentials (VEPs) with VEPs, field potentials, multi-unit and single unit activity (in macaque monkey).

Comparison of VEP responses to a grating presented on-off and reversed in contrast can provide an index of parvo/magno involvement: responses to low chromatic contrast are sustained, but at higher contrast transient components are generated. Isoluminant stimuli can elicit transient-type field potentials which represent residual magno activity. Conversely, the higher the achromatic contrast the greater the asymmetry between on- and off-components reflecting non-transient activity. Thus "selective" stimulation of a mechanism is revealed by its relative response amplitude with respect to the activity of another mechanism.

Temporal tuning of the reversal response is a stricter test of selectivity. Low-pass is characteristic of parvo-activity and band-pass tuning reveals transient-type magno-responses. Isoluminant red/green gratings generate low pass VEPs only up to a contrast increment of 0.1 (above this contrast magno-contribution becomes significant and temporal tuning becomes band-pass). Thus reversal presentations of higher contrasts should be checked for selectivity.

These observations are consistent with single-unit recordings in macaque striate cortex: recorded responses are sustained (low-pass) only for cells responding to low chromatic contrast, and transient (band-pass) only for cells sensitive to low achromatic contrast. Cells with higher contrast thresholds are common and have intermediate responses, consistent with recent demonstrations of dual P/M inputs to some chromatic cells (Vidyasagar, Kulikowski and Dreher, Australian Neuroscience meeting January 1998).

Posters

First and second order coding of orientation: evidence from the tilt aftereffect

R Anderson, M A Georgeson (School of Psychology, University of Birmingham, B15 2TT, UK; e-mail: R.Anderson@bham.ac.uk)

We used luminance-modulated (LM) and contrast-modulated (CM) gratings to test (a) whether a second-order tilt aftereffect (TAE) exists, and (b) whether LM adaptation can induce an aftereffect on CM gratings and vice versa—a crossover effect. Gratings were 1 c/deg with contrast set at a fixed multiple of individual subject thresholds (8x threshold for adapters, 5x and 8x for test gratings). Adapters were tilted +/-15 deg from vertical; carrier pattern was 2-D static, binary noise with 45% contrast. Initial adaptation was for 3 minutes, topped up for 2 seconds between 150 msec test presentations. The experiment was controlled by an interleaved staircase procedure that measured the TAE by nulling it, hence making a tilted test grating appear vertical. In a single-adaptation condition, subjects adapted and tested to LM or CM gratings. In two double-adaptation conditions, the LM and CM gratings were superimposed at opposite orientations ("plaid") or at the same orientation ("compound grating"). With CM adaptation and CM test we found a second-order TAE, of similar magnitude to the first-order TAE. A crossover TAE (from LM to CM or CM to LM) was also found. These findings can be interpreted in terms of a 2-stage model of orientation coding (Anderson and Georgeson, 1997 Perception 26 (supp.) 122), in which cue-specific filters (LM or CM; stage 1) are later integrated.

The effect of contrast polarity on the shift in perceived edge location in 2D blobs with blurred edges (Unpublished)

E Claridge, M Jefferies (School of Computer Science, The University of Birmingham, Birmingham, B15 2TT, UK; e-mail: E.Claridge@bham.ac.uk)

It is generally accepted that in the presence of blur the perceived edge location is not coincident with the zero crossing of the second spatial derivative, but it is shifted towards the area of lower luminance. The magnitude of this shift is a function of blur and contrast, and various models and explanations have been proposed (e.g. Georgeson and Freeman, 1997 Vision Research 37 127-142; Mather and Morgan, 1986 Vision Research 26 1007-1015). Most experiments were conducted on one-dimensional stimuli. In our work, concerned with modelling of edge localisation in skin lesions (dark on light background), we observed that the edges drawn by clinicians tended towards the brighter side, which surprised us. Subsequent controlled experiments confirmed these observations. We used 2D blobs with varying blur and contrast and both polarities. Whereas for the bright blobs the results were consistent with the results of earlier work, for the dark blobs the boundaries were displaced outwards, in the direction of the lighter background, but by smaller amounts than in the corresponding blobs of the opposite polarity. We would like to present the results of the experiments with the hope of receiving feedback which would direct our further research towards a computational model which correctly predicts the perceived edge location in blobs present in complex medical 2D images.

Linear time-varying receptive fields explain complex-cell behaviour

M A Garcia-Perez (Departamento de Metodologia, Facultad de Psicologia, Universidad Complutense, Campus de Somosaguas, 28223 Madrid, Spain; fax: +34 1 394 3189; e-mail: psmet04@sis.ucm.es)

Unlike simple cells, complex cells in mammalian visual cortex appear to be nonlinear mechanisms lacking a structured receptive field (RF), and different complex cells display a variety of mutually inconsistent behaviours. Current complex-cell models postulate nonlinear interactions among multiple simultaneous afferents, but none explains the variety of complex-cell behaviours. We have explored an alternative scheme where these afferents are sequential rather than simultaneous, with a sequence that repeats cyclically over time. A cell receiving afferents this way can be described as if its spatial RF changes shape cyclically over time. Different time-varying RFs arise when the number of afferents, their individual spatial RFs and/or their sequence varies, and we present simulation results showing that they exhibit all reported varieties of complex-cell behaviour. Our results suggest a common linear functional description for simple and complex cells. Nonlinearities are not required to explain complex-cell behaviour.

Spatial probability summation for detecting four patches of grating

T S Meese, C B Williams¶ (Vision Sciences, Aston University, Aston Triangle, Birmingham, B47ET, UK; ¶ McGill Vision Research, Dept of Ophthalmology, McGill University, 687 Pine Av. West [H4-14], Montreal, Quebec, H3A 1A1, Canada; fax: +44 121 333 4220; e-mail: t.s.meese@aston.ac.uk)

When components of a compound pattern stimulate different visual mechanisms, psychophysical performance typically improves by a small amount consistent with probability summation. In spatial vision, compound stimuli have been made from (a) components with different spatial frequencies or orientations and (b) gratings with variable spatial extent (Graham, 1989 Visual Pattern Analysers Oxford). In (a) the number of components in the compound is usually less than four, and in (b) full psychometric functions are rarely, if ever, shown, and interpretation is difficult because sensitivity is not always uniform across the stimulated region. We overcame these previous limitations in the following way. Using a 2AFC technique and the method of constant stimuli, we measured psychometric functions for detecting (i) single 3.75 deg circular patches of 1 c/deg grating centred on each of four corners of an imaginary square surrounding the fixation point and (ii) a compound in which the four grating patches were presented simultaneously. The centres of the patches were 4.24 deg from the fixation point and the sine-phase gratings were windowed by a raised cosine function to remove sharp luminance borders. Stimulus duration was 100 ms. Individual psychometric functions were derived from between 1600 and 2000 trials and the four component-functions were combined probabilistically, producing excellent predictions for the four-patch compound. These results are consistent with probability summation between four independent detectors sensitive to one of each of the four stimulus patches.

This work was supported in part by NSERC of Canada (OGPOO 46528) and MRC of Canada (mt 10818).

Recognition of faces and facial expressions in central and peripheral vision

R Nemeth, M Wright, A Green (Department of Human Sciences, Brunel University, Uxbridge, Middx, UB8 3PH, UK; e-mail: Robert.Nemeth@brunel.ac.uk)

Ten subjects performed a forced-choice identification of stationary 512x512 pixel 12-bit greyscale images of 10 famous faces (5 male and 5 female film stars) presented for 1 sec in central vision and at four eccentricities along the horizontal meridian. The magnification of the peripheral stimuli was increased in accordance with published M-scaling data for contrast sensitivity by varying the viewing distance as follows: 0 deg = 5m; 3 deg = 2.5m; 10 deg = 1m; 20 deg = 0.61m; 40 deg = 0.27m. It was found that recognition decreased strongly with eccentricity despite the scaling. Performance also decreased in progressively degraded images (256x256, 128x128, 64x64 and 32x32 binary images) but there was no significant interaction between eccentricity and resolution: thus the steep decline of performance in the periphery does not depend on the high spatial frequency or low contrast content of the face image. A further 7 subjects performed forced-choice identification of 5 emotional expressions under the same experimental conditions: absolute performance in terms of total errors made was worse in this task, but the slope of the decrease with eccentricity was similar (see also Wright and Bentley, Perception 19 270-271). It is concluded that the recognition of facial identity and facial emotion are limited by visual processes which decrease with eccentricity more steeply than contrast sensitivity.

The role of functional cerebral asymmetry in the perception of faces

N N Nikolaenko (I M Sechenov Institute of Evolutionary Physiology and Biochemistry, Russian Academy of Sciences, 194223, Torez pr.44, Saint-Petersburg, Russia; e-mail: nikolaenko@ief.spb.su)

It has been found that male subjects make systematic errors in the identification of female faces placed in the Left Visual Field (LVF) i.e. when mediated by the right hemisphere. This is surprising because males are thought to have a right-hemisphere advantage in the accuracy and speed of face identification (Rizzolatti and Buchtel, 1977 Cortex 13 300-305; Young and Bion, 1983 Cortex 19 215-225). One possibility is that the right-hemisphere advantage is specialised for own-gender faces. In this study, it was found that, in male subjects, the estimation of profile orientation, gender, and identification of personality type was significantly faster for profiles presented in the LVF than in the RVF and that these effects were stronger when male profiles were used. In females, no significant difference in face identification thresholds for the LVF and RVF was found. In the LVF, mean recognition time for profile orientation, gender, and personality type were much longer for females than for males. In addition, in males, the frequency of personality identification errors in the LVF was less than in the RVF, even with an increase in presentation time. Thus, in males, there is an advantage of the right hemisphere in mediating accurate identification and processing of profiles, particularly male profiles. In contrast, in females, the numbers of errors in the LVF and RVF were about equal. The error distribution curves for both visual fields were similar, and the number of errors changed rather little with an increase in presentation time. Such uniformity of responses to stimuli presented in the LVF and RVF suggests the equal participation of the right and left hemispheres in face identification in females. Analysis of the personality identification errors showed that systematic errors predominated in the LVF, although it is in this visual hemifield that identification is very accurate. The direction of identification errors was found to be different in males and females: females were more often mistaken in the recognition of male faces, and males in the identification of female faces.

A new slant on the tilt aftereffect: orientation aftereffects on global perception of heterogeneous line-segment textures

H L O'Donnell, J E Raymond (School of Psychology, University of Wales, Bangor, Wales, LL58 2DG, UK; e-mail: h.l.odonnell@bangor.ac.uk)

After viewing a row of obliquely oriented lines (inducer), observers commonly report that a row of vertically oriented lines appear tilted in the opposite direction to the previously viewed inducer. This is known as the tilt aftereffect (Appelle, 1972 Psychological Bulletin 78 266-278) and is thought to reflect adaptation or fatigue processes of orientation selective visual neurons. We have previously shown that orientation sensitivity can be quantified using a texture coherence paradigm similar to that used to study sensitivity to global motion in random dot kinematograms. Using heterogenously oriented line segment textures, we obtained global orientation thresholds by determining the minimum percentage of coherently oriented line segments (amidst randomly oriented "noise" lines) required for just correct global orientation judgments. We asked whether pre-exposure to 100% coherently oriented textures would cause alterations to orientation sensitivity consistent with the tilt aftereffect. Since analogous effects in motion can be produced with very brief exposure to inducers (Raymond & Isaak, 1998 Vision Research 38 579-589), we asked whether global orientation aftereffects could also be produced with brief inducers. We presented observers with a 200 msec texture patch (inducer) composed of short line segments oriented horizontally or vertically. After 200 msec, a similar but partially-coherent texture (test) was presented for 180 msec. The percentage of coherently oriented segments in the test patch was varied from trial to trial and observers reported the global orientation of both inducer and test textures. Mean orientation coherence thresholds were significantly elevated when test orientations matched that of the inducer and were reduced when test and inducer were orthogonal. These data demonstrate that viewing briefly presented, homogeneously oriented textures dramatically changes subsequent global orientation sensitivity to similar, partially coherent textures depending on the orientation relationship between them.

Blur discrimination in the chromatic and luminance domain

H C Owens, S Westland, S Wuerger (Human and Machine Perception Research Centre, Department of Communication & Neuroscience, Keele University, Keele, ST5 5BB, UK)

When a red-green sinusoidal grating is presented to observers with normal colour vision, observers usually report that the sinusoidal grating looks more like a square wave than a sine wave. In contrast, a sinusoidal luminance grating of the same spatial frequency is perceived veridically as a sine wave. This phenomenon was first noted by Helmholtz (1909), and has been reported by others (e.g. Mullen, 1982 Journal of Physiology 332 14P), but to our knowledge it has so far not been studied systematically.

We investigated a phenomenon related to chromatic sharpening, namely blur discriminability, in the various colour directions. We measured blur thresholds for square-wave gratings modulated either along the red-green, luminance, or yellow-blue colour directions. We used a 2IFC procedure to assess the amount of blur tolerated. One interval contained the square-wave stimulus and the other contained a blurred square-wave stimulus. We asked the subjects to identify the interval which contained the square-wave stimulus. Stimulus blur was varied by convolving the square-wave grating with a Gaussian mask of varying standard deviations. Blur thresholds were defined as 80% correct response. Blur thresholds for red-green and luminance stimuli were very similar (approx. 1 minute of visual angle). These results are consistent with the idea that the luminance and red-green channels have similar spatial properties. The blur thresholds for yellow-blue were much higher (between 2 and 8 minutes of visual angle) than for the other two directions. A more robust understanding of human spatiochromatic image processing may lead to more efficient image compression techniques.

No use for rose-coloured glasses: A study on the effect of colour on transient system activity with implications for dyslexia research

K Pammer, W Lovegrove¶ (The Australian National University, Canberra, ACT, Australia; ¶ The University of Wollongong, Wollongong, NSW, Australia; e-mail: Kristen.Pammer@anu.edu.au)

Results from recent psychophysical studies have been interpreted to suggest that activity in transient/magnocellular pathways is enhanced by short-wavelength blue light (e.g. Williams et al, 1991 Vision Research 31 2017-2023). Based on the premise that many dyslexic children suffer from a transient system deficit, such results have formed a theoretical basis for remedial work into dyslexia utilising coloured overlays, lenses and text. The current study used nornal readers and was designed to explore the proposal that transient system activity is increased by short-wavelength light and attenuated with longer wavelength light. The study adopted metacontrast designs using coloured masks in light and dark ambient conditions, a Turnus apparent motion task on coloured backgrounds, and two measures of visual latency: temporal order judgement and reaction time, using coloured targets. The results demonstrated that colour had little or no differential effect on either the magnitude or speed of transient system activity, other than a decrease in some instances with red light as predicted by physiological research. However, the predicted pattern of results was demonstrated when contrast rather than colour was manipulated in a final metacontrast study. The results suggest, that conclusions regarding the differential effects of wavelength on transient system activity may be premature, possibly reflecting a colour/contrast confound. However, as the current studies were conducted on normal readers, one can not eliminate the possibility that dyslexic readers per se may receive a beneficial effect of colour on reading, either as a result of some sort of visual anomaly in colour processing and/or in reducing visual discomfort.

Depth perception from disparity of contrast envelopes: sorting the facts from the artefacts

A Statham, M Georgeson (School of Psychology, University of Birmingham, B15 2TT, UK; e-mail: A.K.Statham@bham.ac.uk)

In previous work we showed that stereo depth perception for Gabor patches can exploit the disparity of the carrier (first order disparity) and the disparity of the envelope (second order) (AVA Dundee, April 1997). We also modelled the interaction and combination of these two cues (AVA Guildford, September 1997). We consider here the possibility that second-order stereo depends upon vergence movements, or is an artefact of distortion products that introduce first order information about the contrast envelope. Reducing the presentation time from 300 to 150 ms had little or no effect on depth estimates for horizontal Gabor patches, suggesting that vergence is not important. Depth estimates obtained at peak contrasts of 32%, 8% and 4% were all very similar. This rules out the compressive distortion hypothesis, because such distortion decreases as the square of contrast and so would be negligible at low contrasts.

We then investigated what cues in the contrast envelope were being used for depth perception, by presenting horizontal Gabors and filtered noise patches with correlations of +1 and -1 between the eyes. Depth perception was possible, though much more variable, for the negatively correlated patches that contained no matchable first order features. With -1 correlation there was a bias to respond "back" more often, equivalent to approx 1/4 cycle shift of disparity for the Gabors and 1/2 cycle for the filtered noise. When envelope cues are weak, unmatched features may act as half-occlusion cues (Shimojo & Nakayama, 1994 Vision Research 34 1875-1881; Anderson, 1994 Nature 367 365-368) that drive the percept back so that the response is consistent with a surface seen behind an aperture.

Representational momentum and the human face.

I M Thornton, J J Freyd¶ (Cambridge Basic Research, Nissan Research & Development, Inc., Cambridge, Massachusetts, 02142, USA; ¶ Psychology Department, University of Oregon, Eugene, Oregon, USA; fax: 001 617 374 9697; e-mail: ian@pathfinder.cbr.com)

Faces are complex objects. One dimension of this complexity that is often ignored, at least in respect to processing facial identity, is the fact that faces move. That is, when we smile, speak or gesture approval, our faces change shape. How do such non-rigid transformations influence the way we perceive and represent specific faces? Does motion simply add noise, or could a transforming face actually enhance processing in some way?

Thornton and Kourtzi (1997 IOVS 38 (Supp.) 1004) recently reported performance advantages in a sequential identity matching task when the first image of a face was a short video clip versus a single still frame. The current work explores whether such advantages might arise due to the creation of what Freyd (1987 Psychological Review 94 427-438) has termed dynamic mental representations. Evidence for these structures has come mainly from representational momentum studies in which the final position or configuration of a transforming object is misremembered as being further forward in the direction of motion or change.

Can we find evidence for similar memory distortions when video clips of facial expressions are used? That is, when shown a dynamic smiling or frowning face, will participants misremember the stopping point as being more intense than the true stopping point? Results indicate no evidence for such forward memory bias. Rather, a consistent bias back towards the starting point of the video sequence was observed, suggesting that face-specific dynamic constraints may be overriding more general representational factors (e.g. motion anticipation) during performance of this task.

Temporal processing of spatial alignment information across gaps

S Waugh (Vision Sciences, Aston University, Aston Triangle, Birmingham, B4 7ET, UK; fax: +44 121 333 4220; e-mail: S.J.Waugh@aston.ac.uk)

The effects of exposure duration on alignment thresholds for separated lines were investigated in this study. Alignment thresholds for equally visible lines (i.e. lines a constant multiple above contrast detection threshold) separated by 1.5 deg, were measured for different exposure durations (15-1000 msec). This condition was interleaved with those where the stimulus was closely followed by spatial noise masks. Alignment thresholds for equally visible stimuli presented without a postmask improved with increasing exposure duration as a power function with an exponent of approximately -0.1. However, when the stimulus was closely followed by an appropriate spatial mask, thresholds improved as a power function with an exponent close to -0.5. Experiments did not show a significant change in the spatial scale characteristics of the most effective mask across exposure duration. The improvement in alignment thresholds over time is most likely a result of a square-root temporal integration process occurring for alignment calculations made between separated targets.

Contrast- and luminance-modulated masking as a function of spatial frequency and phase.

A Willis, J M Harris (Centre for Neuroscience, University of Edinburgh, Crichton Street, Edinburgh, EH8 9LE, UK; fax +44 131 650 4579; e-mail: Ola.Willis@ed.ac.uk)

Previous studies have shown that a low-spatial-frequency contrast-modulated grating (beat), composed of two high spatial frequency sinusoids, severely masks the detection of a sinusoid at the beat frequency (Henning, Hertz & Broadbent, 1975 Vision Research 15 887-899). This finding could suggest that channels tuned for low and high spatial frequencies are not completely independent. Alternatively, the masking could arise as a result of an early nonlinearity injecting a distortion product at the beat frequency. If so, the effects of masking by a beat should be similar to those of masking by a `real' grating at the same frequency. Here, we explore the effects of masking by a beat and a `real' grating on the detection of test gratings as a function of both spatial frequency and phase.

Contrast thresholds were measured for a range of test spatial frequencies (0.25 - 4.0 c/deg) in the presence of a masking grating. The mask was either (a) a contrast modulated grating with a beat frequency of 1 c/deg and components of 8 and 9 c/deg, each with a contrast of 20% or (b) an 8 c/deg sinusoidal grating of 20% contrast plus a 1 c/deg grating of 3% contrast. The phase of the test grating was randomised at the start of each trial. Under some conditions the effects of masking by beats and `real' gratings were similar. Specifically, both the beat and the `real' grating produced robust elevations of contrast threshold about the beat frequency (see also Smallman and Harris, 1995 Perception 24 127). However, not all the data lend support to the simple distortion product hypothesis: the detection of test gratings in the presence of a real masking grating is dependent on phase, but detection is phase-independent when masked by a beat of the same frequency.

Motion in depth and nonrigid motion of subjective contours.

M J Wright (Department of Human Sciences, Brunel University, Uxbridge, UB8 3PH, UK; e-mail: Michael.Wright@brunel.ac.uk)

Using computer animation techniques it is demonstrated that a Kanisza square can be made to rotate in depth or advance and recede. The square is seen as a surface moving in front of the background dots. The subjective contours of a Kanisza square rotating in depth can appear slanted or tilted in depth even without stereoscopic cues. The effect depends upon using perspective projection. If perspective cues are removed from the rotating square, the percept becomes ambiguous, and it can be seen either as a square rotating in depth or as a flat rectangle undergoing nonrigid motion (squashing and stretching). Replacing either the inducing dots or the background with an equiluminous random texture almost completely abolishes the subjective contours when stationary but they reappear when the square is in motion.

However, a rotating Kanisza square composed entirely of second-order motion contours (accretion and deletion of texture) does not show contour completion over a textured background. It is concluded that motion and perspective can give rise to subjective surfaces slanted or tilted in depth, and to non-rigid as well as rigid surfaces. The effects are as strong or stronger than with stationary figures.

Mapping colour- and brightness contrast over the visual field

S N Yendrikhovskij, A M Shamshinova¶, L.I. Nesteryuk¶, E.N. Sokolov# (IPO, Institute for Perception Research, PO Box 513, 5600 MB Eindhoven, The Netherlands; ¶Moscow Helmholtz Institute of Eye Diseases, Russia; #Moscow State University, Russia; fax: +31 40 2431930; e-mail: yendrik@ipo.tue.nl)

If brightness contrast is set to be equal for two coloured stimuli A and B presented in the fovea, would it remain equal when these stimuli are presented in other regions of the visual field? Results of this study with an original method of equalizing brightness contrast suggest that the answer is probably `No'.

Our method is based on recording of reaction time (RT) to coloured stimuli of various luminances presented against a constant coloured background. If stimulus and background colours are the same, than a stimulus with a brightness value equal to a brightness value of the background is invisible and undetected (RT is infinity). But if the colours of the stimulus and the background are different, the equalization of the brightness contrast leads only to partial impairment of detection (RT is maximum but not infinity).

Thus, the maximum RT can be used as an indication of the minimum brightness contrast. In experiments applying this criterion to stimuli presented in various areas of a computer display we have obtained `maps' of retinal sensitivity to different combinations of colour and brightness contrast. The analysis of these `maps' demonstrates that the luminance value at which the RT reaches its maximum differs significantly between the fovea and parafoveal and peripheral regions.

The direction and size of the discrepancy depend on stimulus and background colours, stimulus size and background luminance. Results of a clinical study with 28 normal subjects and 320 patients (hereditary and acquired colour defects, amblyopia, albinism, glaucoma and Stargardt's dystrophy) have shown that these `maps' exhibit characteristic patterns for different eye diseases.

   
webdesign by ablen