![]() |
Promoting vision research and its applications |
|
||||||||
| AVA96 Reading Abstracts
Meeting Abstracts A model for controlling eye movements in multi-fixation search. During multi-fixation search the visual system and the motor system work in close co- operation. During fixation the foveal target has to be analysed, the peripheral field has to be sampled and a new eye movement has to be prepared. We present a schematic model which describes multi-fixation search in large displays. The model is based on recent experimental results. The visual field of the model contains a foveal and a peripheral part. Foveal analysis provides two kinds of information. Firstly, it decides whether the target is present at the fovea or not (WHAT). Secondly, it gives information about the time needed to analyse foveal targets (WHEN). Peripheral analysis provides potential targets for eye movements (WHERE). Peripheral and foveal analysis occur in parallel. Fixation durations are pre-programmed. Thus, the duration of a fixation does not directly depend on the time needed for the analysis that occurs during that particular fixation. Timing of saccades is based on cognitive factors such as strategy, prior experience and estimated difficulty of the foveal analysis. So, foveal search indirectly affects fixation durations, peripheral search does not. In the case of a recognised target the WHAT system sends a signal to the saccade generator to stop the production of saccades. Due to pre-programming of fixation durations the "cancel" signal may reach the saccade generator too late. These late cancellations result in return saccades. The model adequately describes relationships between performance, distributions of fixation durations and occurrences of return saccades in multi-fixation search. Making Sense of Sensors: An Exploration into the Effects of Stimuli and Sensor Design on Active Fixation Purposive visual system design currently provides solutions to useful but contrained visually- guided tasks such as determining time to contact or egomotion. Restricting the design process to the visual control of simpler behavioural subtasks such as these will arguably allow the development of a clearer understanding of the kinds of information which are pertinent to more general applied vision systems. However, despite emphasis on the importance of input information as a guide to action, little attention has been given to the design of the active sensor picking up this information, or the fact that careful consideration of its design could greatly enhance efficiency by supplying information in a more useful form for the visual system. The work described here is an example of this purposive/active approach which concentrates on the design of an actively-fixating sensor working with moving random-dot pattern (RDP) stimulus. The use of RDP allows a constrainable (but wide) range of spatial frequency detail to be used which can be manipulated by altering the dot sizes and the degree of gaussian filtering, as well as the inter-frame displacement. The sensor design variation lies in the degree of non-uniform sampling it carries out. By altering the relative size of the central, uniformly-sampling 'foveal' region a design can be created which effectively turns the sensor from a purely uniform-sampling sensor to a anthropomorphic log-polar sampling sensor, with a number of hybrid designs in between. Examining the fixating abilities of each sensor for a range of spatial-frequency-varying RDP, and using psychophysical results from the motion perception of RDP as a guide, allow conclusions to be drawn concerning the most promising sensor design to work in a given visual environment. Results suggests that a pure uniformly- sampling sensor places unneccessarily strong constraints upon the kinds of stimuli which can be reliably tracked, whilst hybrid designs perform better over a wider spatial frequency range. Does the level of attentional processing differentially affect adaptation to moving stimuli? The mechanisms of attention in early visual processing were investigated by studying attentional effects on adaptation to moving stripes. It is already known that, when subjects have to report the identity of changing digits at the fixation point, movement aftereffect duration is reduced compared with when they are not required to process the digits (1). In the present studies, subjects fixated a sequence of changing digits at the centre of a rectangular window of high contrast drifting stripes. The stripes were of one of 2 spatial frequencies: higher (2 c deg.1) or lower (0.5 deg.1). Attentional load was varied by presenting an unchanging zero as the fixation point (normal), or a sequence of digits which the subject had to report them aloud as they changed and selectively to respond to either a 4 or a 7 with a 'YES' (diverted), or a similar sequence without the need to report or respond (passive attention). The duration and velocity (measured by matching) of MAEs were measured as indices of the effects of attention on adaptation. The effects of diverting attention were greater for the higher than for the lower spatial frequency, suggesting that the processing of finer spatial detail requires more attention than of coarser detail. MAE velocity and duration were progressively reduced by increases in attentional-cognitive load. Enlarging the blank window around the fixation digits reduced the effects of all attentional conditions, suggesting that the attentional effects are strongest close to the fixation point. Control experiments showed that the effects of reporting the digits on adaptation did not reflect unusually precise fixation, since moving the fixation digit randomly to different locations reduced rather then enhanced MAEs. The results confirm that diverting attention can affect motion-sensitive mechanisms which are thought to occur early in visual processing, and suggest that withdrawal of attention preferentially affects higher spatial frequency processing. 1. A. Chaudhuri (1990) Nature, 344, 60-62. The additive effect of higher-order influences on the visual accommodation response The visual accommodation response appears to be driven by both visual (blur) cues and also higher-order influences such as perceived distance and mental effort. The aim of this study was to examine whether the two higher-level influences mentioned above have an additive effect on the accommodation response. Perceived distance was manipulated by superimposing one stimulus on top of another so that occlusion cues would suggest that the superimposed stimulus was closer. A mental processing task (subtracting seven from a two digit number) was also used that could be dependent on the superimposed stimulus (the information was presented visually) or independent of it (the information to be processed was presented aurally). The data suggest that both the superposition of stimuli and the mental processing task could, independently, affect the level of accommodation and that, when the two elements were present together, the effects on the accommodation response were additive. The data further suggest that the combined effects of different higher-order influences may be mediated by attentional factors. Spatial filters, zero-crossings and the tilt aftereffect. The perceived structure of plaids suggests that oriented edges in these 2-D images are found at the location of zero-crossings (ZCs) in a neural response array ('neural image') which is formed as the sum of the outputs from different oriented filters (1). The summation across oriented filters emulates circular filtering and the model behaves like the Marr-Hildreth model (2), correctly extracting vertical and horizontal ZCs from plaids that contain only oblique Fourier components. This contrasts with the orthodox view that orientation is encoded by the peaks of activity across a set of oriented filters (3). This view makes incorrect predictions for plaids, but accounts well for tilt illusions and the tilt aftereffect (TAE) where the response distribution is shifted by adaptation (4). On the ZC account, however, no TAE is predicted because vertical ZCs would remain vertical in the neural image, no matter how the sensitivity of the filters was affected by adaptation. A revision of the ZC model is clearly required. After adapting to gratings, we found that the ZCs of a plaid appeared to tilt, or shift in their relative positions, in the way that would occur if the Fourier components of the plaid were physically rotated. That is, both the orientation and position of parts of the plaid's components appear to be shifted by adaptation, and the global change in ZC pattern is a direct consequence. We outline a model with the following stages: (i) Oriented spatial filters compute a Gabor or other wavelet transform of local, overlapping patches of the image; (ii) Adaptation distorts this transform (as in the orthodox account), and hence distorts the geometry of each patch; (iii) After inverse transformation, the patches are summed together across space to form a neural image, with adjustment of patch positions to maximize the contrast energy (and hence the overall coherence) of the neural image; (iv) ZCs are found in the neural image. In this model adaptation alters only the sensitivity of the tuned filters, but because of the positional adjustment (or 'jiggling'), distortion of the local patches is expressed in the global ZC pattern. The natural role for 'jiggling' may be to correct for spatial error (receptive field 'scatter') in the mapping from retina to cortex (5). 1. Georgeson, M. A. Proceedings of the Royal Society B 249, 235-245 (1992). The Linearity of Early Visual Analysis Early stages of visual processing are often modelled as linear, although nonlinearities are known to exist. A central issue concerns the stage at which such nonlinearities become significant. Subjects were adapted to a high contrast sinusoidal grating followed by a brief presentation of a contrast modulated test pattern. Thresholds for the detection of the contrast modulation (the beat) were measured. Results show that threshold elevation depends significantly on differences in orientation and spatial frequency between the adapting grating and the principal Fourier frequency (carrier) of the test pattern. Moreover, our data also suggests that the mechanism that mediates contrast detection is itself spatial frequency tuned with a peak tuning frequency at around 0.425 cycles per degree. The data are consistent with a model in which the contrast beats are processed by a frequency-specific manner, after an initial stage of frequency- and orientation-specific linear filtering. Incorporating models of early vision processing into virtual reality Recent models of early visual processing espouse a computational approach to vision. Such computational theories can in principle be implemented on a computer system to demonstrate their utility and efficacy. However with the development of computer graphics technology, another form of computer simulation can be envisioned: that is, to visually experience the effects of a model. Thus, a model of human early visual processing has been incorporated into a real-time, 3D graphics system; commonly referred to as virtual reality (VR). Specifically, it will be demonstrated how models of spatiotemporal contrast sensitivity can be encoded efficiently into a VR system. With this knowledge, the VR system can then decide to remove certain regions of detail from a scene which it believes to be sub-threshold (based upon attributes of the model such as its velocity, eccentricity, distance, orientation etc.). Effectively, by degrading the sensations which are presented to the visual system, we can investigate the information that is available to the higher vision processes. Also, because the inherent latencies of contemporary VR systems are aggravated by the amount of detail in a scene, this also has the potentially beneficial effect of improving the interactivity of VR technology. However, it is hoped that the development of this kind of tool will not only produce a more efficient interactive graphics system, but also provide a flexible mechanism to evaluate models of threshold vision in a simulated, real-world and full-colour environment. Modelling retino-cortical image-conditioning to achieve automatic recognition Many attempts have been made to achieve automatic recognition. Most of the methods attempted, usually in the fields of physical optics or computer-based image processing, were rigid and mechanistic. This inevitably led to difficulties with scaling or orientation, or the techniques otherwise lacked tolerance of variable imaging conditions. In considering the demands upon a recogniser, we concluded that the Self-Similar Stack model of human vision (1) is ideally configured as the front-end image conditioner for an automatic recognition system. The salient reasons for this are, firstly, that this model, which mimics some of the requisite mechanisms of human retinal vision, is simple to program recursively. Secondly, it acts as an automatic normalisation device that feeds the conditioned signals into arrays of small neural networks. We have found that such conditioning is necessary for the stable operation of such simple nets. These nets act in a similar way to the primary visual cortex, which also operates on similarly conditioned neural signals, and selects arrays of primitive features in natural images. The system, as constructed so far, will be described and its activities illustrated by a comparison of the input image features and the output coding of those features. The results of some parametric optimisation experiments will be shown, together with assessments of the efficiency of the feature extraction processes. Full recognition has yet to be achieved, as the system is still incomplete, but we will briefly discuss the proposed way ahead. 1. Burton, G.J., Haig, N.D. & Moorhead, I.R. (1986) Biological Cybernetics, 53, 397-403. The functional role of neuronal synchronization in a large-scale network model of visual processing We present a neurobiologically motivated network model that specifies in detail plausible computational mechanisms for the segmentation, selection, and recognition of objects. A major goal of this work is to analyze putative functions of neuronal synchronization within a large-scale network model performing complex visual tasks. The model consists of 12 simulated modules which are organized into two main processing routes, the 'where-pathway' dealing with location information and the `what-pathway' computing the shape and attributes of objects. Invariant object recognition is supported through the segmentation of the visual field into distinct entities. In order to represent different segmented entities at the same time, the model uses a powerful oscillatory binding mechanism. Connections between the where- pathway, the what-pathway and short-term memory modules lead to a flexible cooperation between different functional subsystems resulting in complex overall behavior including top- down segmentation, texture segregation, spatial and object-based attentional selection, visual search, position- and size-invariant shape recognition, structure-from-motion and attentional selection of different levels of hierarchical stimuli. The model delivers an explanation of many experimental findings made both in physiological recordings from the visual cortex and in psychophysical studies. Neglect phenomena in a neural network model of visual attention and object recognition To see or not to see: on unilateral hysterical blindness We report the curious case of a 27-year-old woman, SB, who reported that over a two week period the vision in her left eye deteriorated from normal to no perception of light. Clinical examination revealed no obvious pathological cause for her unilateral loss of vision. There was no clear evidence either of a psychiatric disorder that may have precipitated hysteria or of any reason that SB may be malingering to gain some advantage by pretending to be visually defective. Visual evoked potentials produced by a reversing checkerboard pattern were normal both in amplitude and latency. Results from a lumbar puncture were normal. An MRI scan did not show any lesion in SB's brain. Optokinetic nystagmus was normal in both eyes. Snellen visual acuity was 6/6 in the right eye but not measurable in the left eye. Similarly, near vision was not measurable in the left eye but N5 in the right eye. Luminance and chromatic sensitivity was assessed psychophysically by measuring increment thresholds to white and red flashes of light, respectively. Motion sensitivity was assessed by measuring motion coherence thresholds for moving random dots. A two-interval forced-choice procedure with the method of constant stimuli was used and data were plotted as a psychometric function. Results from SB's right eye were normal with data for correct responses at approximately 50% (chance performance) for low-intensity stimuli rising to 100% correct for high-intensity stimuli, corresponding to stimuli below and above threshold respectively. Data from SB's left eye for low stimulus intensities were approximately 50% correct as would be expected. For high stimulus intensities, there was evidence that performance was much less than chance. This result for high stimulus intensities is consistent with SB giving incorrect responses suggesting a diagnosis of unilateral hysterical blindness. Furthermore, the strongest evidence to suggest that this was a case of hysterical blindness was that SB had functioning stereoscopic depth perception as assessed by the Lang Stereo Test and TNO Test for Stereoscoptic Vision: SB was able to detect disparities as small as 120 sec arc. Theoretical mechanisms of hysterical blindness will be discussed. Extension of the mLGN hypothesis of dyslexia? Dyslexia is a developmental disorder affecting up to 10% of the population who, despite adequate intelligence, education and opportunity, have a specific disorder in reading. In addition to the evidence that phonological deficits underpin some aspects of dyslexia there is also increasing evidence that some dyslexics have low level visual deficits, for example in coherent motion perception. Correspondingly in the magnocellular layers of lateral geniculate nucleus (mLGN) of dyslexics one sees smaller cells and less organization than in the mLGN of control brains. However, it is difficult to relate the simple properties of mLGN cells to the complex requirements of reading. Given the antomical continuity between a sub-population of mLGN cells and the parietal cortex, one might expect subjects with deficits related to mLGN dysfunction to be impaired on tasks which require the kinds of visuospatial attention to which parietal cortex contributes. To examine this possibility we tested three groups of subjects on eight visual search tasks: a control group of normal reading ability; a dyslexic group with elevated motion coherence thresholds (MD); a dyslexic group with normal motion coherence thresholds (nonMD). On all tasks the MD group were significantly slower than the controls and slower than the nonMDs. The results were not a consequence of simple motor reaction times (RT). On all tasks the MDs showed the same pattern of RTs as the controls and nonMDs (i.e. all groups performed the same tasks as parallel or serial), suggesting that there was no qualitative difference in the strategies used by the groups. On two of the tasks there was an interaction between group and number of distractors. Two tasks were designed to incorporate components of reading operations but the MD's deficits were not exacerbated on these tasks. It appears that there is some continuity, or correlation of deficits in a subgroup of dyslexic subjects (MDs) but it does not necessarily follow that the deficits are related to specific reading disorder. Sensitivity to expressive signals from the human face: Psychophysical and neuropsychological investigations In a 6 way forced choice expression naming task, controls identify the exemplars of the image set with 100% accuracy. S.E., a post-encephalitic patient with bilateral amygdala damage showed a severe impairment in the labelling of fear in the forced choice task, (comparable to that obtained by Young et al using the Ekman faces set) and impaired ability with the expressions of disgust and sadness in a free naming task. However, S.E.'s performance in the psychophysical expression discrimination task was comparable to that of controls suggesting that his difficulties lie in the interpretation of facial expressions rather than in face pattern perception. Hager, J.C. & Ekman, P. (1979) Ethology and Sociobiology 1, 77-82. Would a colour by any other name be found as fast...? Maximally discriminable colour sets are often used in computer displays to facilitate task performance. Such sets are usually derived by ensuring high minimum and high average perceived differences between pairs of colours within the set. These differences are identified by a metric known as (E. If the smallest (E in the set is greater than a critical value, then all discriminations will be equally fast (1). Many algorithms exist to ensure colour sets have the required minimum value (2,3). Ensuring this minimum value in any set of colours becomes more computationally intensive, the greater the set size. Recently it has become apparent that choice-reaction times to colours do not depend only on the physical differences between colours, but also on the names of the colours (4). This points to a possible inadequacy in current colour set production methods since the effect of colour name boundaries is rarely taken into account. Incorporating knowledge of name boundaries in the production of colour sets may mean that they have lower (E values than those based purely on physical colour differences, but have similar or superior performance on name related tasks. In order to investigate this possibility, subjects took part in a colour naming and colour search task. Sets of highly nameable colours were generated from a previous study (5). Reactions to these colours were compared with (E matched sets and maximally discriminable sets in both of which nameability was not constrained. It was found that nameability was an important factor in name based search and identification tasks. The results of the experiment will be discussed in the light of an updated physiologically plausible model (6) linking colour language to colour discrimination. 1. Carter R.C. & Carter E.C. Visual search and color coding. Proceedings of the 23rd meeting of the Human Factors Society, 369-373, (1979). Colour constancy as a function of typical hues Colour constancy differs for various hues (1,2). This is important since colour constancy and hue categories are subserved by different stages of processing (3). We investigated colour constancy for 40 Munsell hues presented in succession, using successive matching, when illumination changed from standard daylight white illuminant C to either tungsten (A) or green G (close to orthogonal on the u'/v' plane). Typical hues (red, yellow, green, blue and violet) were also obtained for all subjects under these illuminants, using the task analogous to testing for unique hues. Colour constancy is locally better for typical, or categorical hues, as though the main colour categories were pivotal reference points. When the degree of colour constancy was conveniently expressed in terms of a relative measure (Brunswick ratio), it showed maxima around the typical hue regions. Subsequently, the dominant wavelengths of hues were estimated. The degree of colour constancy and typical hues could then be compared and correlated with the standard spectral sensitivity curve and the averaged centroid colour obtained using a double 2-alternative forced-choice method (4,5). 1. Troost, J. and de Weert, C.M.M. (1992). Naming versus matching in color constancy. Perception & Psychophysics 50, 591-602. 2. Kulikowski J.J. & Walsh V. (1993). Colour constancy for categorical centroid and border colours. Perception 22, S13. 3.Walsh V., Kulikowski J.J., Butler S.R. & Carden D. (1992). The effects of lesions of area V4 on the visual abilities of macaques: colour categorization. Behav.Brain Res. 52, 81-89. 4. Mullen K.T. & Kulikowski J.J.(1990). Wavelength discrimination at detection threshold. J.Opt.Soc.Am., A7, 733-742. 5. Jordan H. & Kulikowski J.J. (1995). Wavelength discrimination at detection threshold reveals colour opponent mechanisms. J. Physiol. 485, 20P. Estimating surface colour under varying illumination Many computational theories of surface colour estimation have been proposed to account for the visual phenomenon of colour constancy. To deliver good estimations, each theory begins by making simplifying assumptions about the world. The realism of the assumptions determines the value of the theory. Two of the most common assumptions made are that each scene is illuminated by a single light source and that the colour of the light is well described by a linear model. We propose that neither of the assumptions are realistic and assume their opposite in developing a new theory of surface colour estimation. We argue against the idea that each scene is illuminated by a single light source simply because it does not represent the world in which we live. Multiple light sources are the norm. For example, in the outside environment there are two spectrally distinct light sources, the sun and sky, and indoors, light is generally a combination of artificial and daylight illuminants. Moreover in both cases interreflected light (light reflected from other surfaces) contributes to the illumination at each point. We argue against a linear model of light colour not because it is insufficient but rather for the opposite reason; a linear model is infact over general. In particular, we show that while a linear model adequately describes light colour, a non-linear model with fewer parameters is equal to the task. Based on the non-linear model of light colour we develop a new algorithm for surface colour estimation. Simulation experiments demonstrate that the algorithm can reliably estimate surface colour so long as there is at least one surface in every scene illuminated by two lights. Estimation performance is excellent and surpasses, by a large measure, existing computational theories. Spatial separation and the motion aftereffect The motion aftereffect (MAE) reflects both early and late processes in vision: adaptation is local and early, whereas the MAE is dependent on the global structure of the test pattern1. That is, if adaptation is to two moving gratings flanking a stationary central grating, an MAE is seen in the central grating if two gratings surround it, but in the flanking gratings when they are themselves surrounded in the test stimulus. We report two experiments which examined the spatial range over which the early adaptation process operates. MAEs were generated after the method described by Swanston and Wade2: exposure to two peripheral, laterally moving gratings induces an MAE in a central test grating, when three stationary gratings are presented as the test pattern. In Experiment 1 the peripheral test gratings were in the same position as the adaptation gratings, but their separation from the central grating was increased from 0.5 to 5 deg. The durations of MAE did not decline with increasing separation between the inducing and induced gratings. In Experiment 2 the peripheral test gratings were either in the same location as adaptation, or separated by 2.7 or 5 deg from it. The MAE duration decreased monotonically with increasing separation of adaptation and test gratings. The results are interpreted as indicating the envelope of action over which adaptation operates. 1Wade, N.J., Spillmann, L. And Swanston, M.T. Visual motion aftereffects: Critical adaptation and test conditions. Vision Res. in press. The second-order mechanism cannot detect direction at threshold It is well known that for first-order motion (e.g. a drifting grating) direction of motion can be detected at threshold i.e. if a drifting grating is visible at all then its direction is discernible. We have investigated whether this is also the case for second-order motion. We measured thresholds for identifying the orientation and direction of a drifting 1 c/deg sinusoidal contrast modulation. The contrast envelope was shaped in both space and time. Thresholds were measured in terms of contrast modulation depth. Various drift speeds and carrier contrasts were tested. Two different types of carrier were employed: dynamic broadband noise and static high-pass filtered noise (static broadband noise causes first-order artifacts (1)). For both types of carrier, direction thresholds were consistently higher than orientation thresholds (with a static noise carrier the two thresholds were similar, as they are for first- order motion). The modulation depth needed to identify the direction of motion of a drifting contrast envelope is typically around 50% higher than that needed to identify its orientation. This result suggests that second-order motion is detected by a mechanism which cannot identify direction at threshold and which is therefore distinct from that which detects first- order motion. Supported by SERC (Image Interpretation Initiative). 1. Smith, A.T. & Ledgeway, T. Perception, 24, 28 (1995) Detection of motion-in-depth and frontoparallel motion amidst static noise dots Image motion is detected at a very early stage of visual processing. Detecting an object moving in three dimensions is a more complex process, not least because under some circumstances the eyes can provide very different motion signals. For example, if a point moves directly towards the observer it will generate equal and opposite horizontal motions in the two eyes. This study investigated whether the visual system has independent access to the basic monocular motion signals when an object moves directly towards the observer. The stimulus comprised an array of stationary points, randomly scattered throughout a cylindrical volume (axis along the depth-dimension). A centrally positioned bright dot served as the fixation point. Stimuli were shown in two intervals, one of which also contained a single target point moving directly towards the observer. We measured target detection as a function of the number of noise dots in the cylinder. In a control condition, observers viewed a single binocular half-image, in which the dots were positioned in a single plane and the target motion was horizontal. The extent of monocular horizontal displacement was equal in the control and motion-in-depth conditions. For the control condition, the target was easy to detect, even for large numbers of noise points. For motion-in-depth, it was much harder to detect the target, even in the presence of only 8 noise dots. However, when a small amount of horizontal motion was added to the target, it became very much easier to detect. This finding suggests that the visual system does not have independent access to monocular motion signals when they are combined for motion-in-depth detection. Further, the effectiveness of static noise dots in masking the motion suggests that motion-in-depth may be detected using static disparity information rather than motion information. The effect of positional jitter on coherence thresholds in quantised random dot kinematograms Using a 2-AFC direction-discrimination paradigm, coherence thresholds for detecting motion in two-frame random dot kinematograms were measured for human observers and calculated, or obtained by simulation, for the ideal detector. Statistical efficiency depends on the square official threshold/human threshold, and we have found that it approaches 50% under conditions matched to the properties of the motion detecting system (1). These conditions occur when the dots are restricted to positions on a coarse uniform grid ('coarse quantisation'). Using a finer grid has little effect on coherence thresholds for humans, but rapidly improves those for the ideal, so efficiencies fall as the grid becomes finer. We suspect the low efficiencies observed when quantisation is not coarse reflect the inability of the human coherent-motion detecting system to utilise precise positional information. We therefore examined the effect of varying the positional precision of the displays directly. Quantised stimuli that had yielded the highest efficiencies in previous experiments were used, with the difference that the position of each dot was perturbed by a small amount of positional jitter. We hope that the change of coherence threshold with increasing jitter will enable us to estimate the amount of 'intrinsic jitter', or positional uncertainty, in the coherent- motion detecting system, and preliminary results indicate that it is very large. This fits in with the coarse orientation and velocity tuning observed neurophysiologically. 1. Barlow, H.B., Tripathy, S.P. & Baddeley, R. (1995) Perception (suppl.) 24,1. Controlling heading during locomotion: Steering to a different paradigm A question germane to most forms of locomotion is how can we control our direction of motion when we are looking at a point other than where we are heading (1, 2). Linear translation across a ground plane produces a radial expansion of texture elements within the visual field. If an observer fixates a target other than at the focus of expansion, and tracks that target with eye movements, then a rotational component is introduced into the retinal flow field. In such a case there has been disagreement as to whether the optical flow field provides sufficient information to discern the direction of egomotion when the rotational component is large (3-6). There is, however, a further shortfall noted by Nakayama (3) who suggested 'As yet, however, researchers have not gone beyond these psychophysical observations to show that animals actually use this information to perform real locomotor tasks'. Previous studies have used psychophysical judgements of heading direction under passive movement, and the debate has centred upon whether heading judgements are robust in the presence of rotations of the order of 5 deg/s (3-6). The rotation component is a function of depth and orientation of the fixation point and locomotor velocity, such that 5 deg/s would occur when travelling at 5m/s and fixating a target 20m ahead at an eccentricity of 20 deg. We presented human participants with a simulator task where gaze was directed to an eccentric target, and participants were required to actively adjust their locomotor heading to approach this target. The crucial difference between a passive-judgement task and an active- steering task is that, in the latter, participants no longer need to decompose the flow pattern into radial and rotary components, but are simply required to perceive the direction in which they should steer to null the rotary component. We found that participants could adjust their heading to an accuracy of under 1.5 degrees, without access to non-visual information specifying gaze. Of particular interest was the observation that participants were able to steer on course, without encountering rotations of 5 deg/s. This suggests that what is important is that locomotor animals are sensitive to small degrees of rotational asymmetry in the vector flow field that will allow them to adjust heading early during the approach trajectory and hence avoid high degrees of rotation that occur at greater eccentricities. A second finding was that participants were unable to effect this level of control if the rotational asymmetry was, in part, due to motion of the target across the ground plane, suggesting that non-visual information specifying gaze direction may be necessary to steer towards moving targets. These findings question whether the psychophysical techniques used previously (3-6) reflect upon the control of steering in natural contexts, and suggest that a paradigm shift is required for future investigations of the role of vision in the control of animal locomotion. 1. D. Regan & K.I. Beverley, Science 215, 194 (1982) This research was supported by JCI/MRC Grant G911369 and JCI/MRC Grant G9212693. What information are higher-level motion processes receiving ? In the H1 neuron of the fly, the response to a maintained motion stimulus decreases over time while increased sensitivity to variations in speed around the maintained level is reported (1). We investigate the form and time course of motion adaptation, comparing the psychophysical performance of human subjects with existing electrophysiological data on insect vision. Firstly, we investigate the effect of adaptation duration on absolute sensitivity to motion. Subjects are required to perform a 2AFC speed discrimination between a stimulus maintained at a constant velocity and a short burst of a comparison stimulus. Secondly, we investigate the effect of adaptation duration on sensitivity to modulation of velocity around the adapting level. Subjects are required to carry out a 2AFC detection of a brief sinusoidal modulation of the velocity of one of a pair of stimuli maintained at the same constant velocity. For both speed discrimination and velocity modulation detection tasks, psychometric functions were measured for a range of adaptation durations. The perceived speed of a maintained motion stimulus is found to decrease as a function of adaptation duration. Sensitivity to modulation of the velocity of a maintained motion stimulus is found to increase as a function of adaptation duration. The form and time course of the sensitivity changes are comparable with existing evidence of motion adaptation in the fly, suggesting that adaptation to motion in the human visual system performs a similar function to that in the fly. We propose that, in both cases, adaptation serves to improve the transmission of novel motion information along the visual pathways at the expense of maintaining an accurate representation of the unchanging components of the stimulus. By sacrificing absolute sensitivity to motion for relative sensitivity in this way, the adapting system can be seen as devoting a greater bandwidth to the transmission of changes in the motion signal (2). Such a coding strategy would facilitate the computation of higher- order properties of the motion signal through subsequent processing. 1. Maddess, T. & Laughlin, S.B. (1985). "Adaptation of the motion-sensitive neuron H1 is generated locally and governed by contrast frequency", Proc. R. Soc. Lond. B 225, 251-275. 2. Clifford, C.W.G. & Langley, K. (in press). "A model of temporal adaptation in fly motion vision", Vision Res. Thresholds for detecting change in shape Koenderink (1) suggested that the decomposition of optic flow might be accomplished in part by mechanisms sensitive to changing orientation giving an approximation to the rotation component, 'curl' and to changing relative orientation giving an approximation to the deformation component, 'def'. Over the last two years we have presented evidence for specialised motion mechanisms sensitive to changing orientation. Our present experiments, on the other hand, provide no compelling evidence for a special mechanism for analysing changes in relative orientation. One of the simplest stimuli that possess the attribute 'shape' is a plaid consisting of two identical grating components. We may vary the 'shape' of plaid blobs or diamonds by varying the angle between components. In a single-interval, two-alternative, forced-choice procedure, the subject's task, on any trial, was to detect a change in the angle between the components. In order that the task should not be soluble by detecting the rotation of a single component, a random orientation change was added equally to both components. This effectively makes the task into a discrimination between rigid (no change in relative orientation) rotation of the plaid. The subject attends to the 'blobs' or 'diamonds' of the plaid and reports whether they change shape as well as moving. Minimum thresholds were 1-2 deg; i.e. 4-8 times higher than for discrimination of the clockwise or anticlockwise rotation of a single component of the same plaid. The task is rather difficult, the problem being that the 'blobs' or 'diamonds' seem to change shape even when they rotate rigidly. Inspection of the psychometric function confirms this: it is asymmetric; there is a tendency to misidentify rigid rotations as nonrigid. Further experiments confirm that detection of changes in relative orientation is disrupted by common added rotation. This is evidence against a specialised 'def' detector based on orientation change, and against the rotational invariance of dynamic shape changes. The addition of common added translation also affects 'shape change' discrimination in plaids, whereas the detection of component rotation is almost completely immune to translation noise. This may indicate that simple directional mechanisms contribute to the 'shape-change' discrimination task but not the 'rotation' discrimination task. 1. Koenderink, J.J. (1986) Vision Research, 26, 161-180. Masked and unmasked grating detection and orientation discrimination in normal and amblyopic observers The aim of these experiments was to investigate the interactions between spatial frequency channels in normal and amblyopic observers. Grating detection and orientation discrimination performance were compared. In addition the effect of a low spatial frequency mask upon grating detection was examined. All stimuli consisted of monocularly viewed sinewave gratings (75% contrast) of variable spatial frequency. Five normal subjects and five amblyopic observers participated in the experiments. In one temporal 2-AFC task (grating detection), the subject was required to detect the interval in which a grating (vertical or horizontal) was present. A second temporal 2-AFC paradigm was employed (orientation discrimination) in which a vertical grating was presented in one interval and a horizontal grating in the other. The task of the subject in this case was to identify the interval containing the horizontal grating. Grating detection was measured in the presence and absence of a low spatial frequency mask of the same physical contrast and orientation. The spatial frequency of the mask was always one-third the spatial frequency of the test grating, and both the stimulus and the mask had random phase. The reductions in orientation discrimination found for the amblyopic observers were predictable from their reduced grating detection performance. The presence of the mask reduced grating detection performance by 15-30% in all subjects tested. There was no difference in the effect of the mask in the amblyopic subjects when compared to the normals. Amblyopes showed no extra loss in orientation discrimination tasks or in masked grating detection tasks relative to normal observers. These results provide no evidence for abnormal interactions in amblyopic observers. Binocular image integration and stereopsis: A subjective analysis Binocular-rivalry plays a significant part in integrating the two eyes' images to perceive just one. Two hundred and thirty five years ago DuTour (1760) is reputed to have asked "Pourquoi un objet sur lequel nous fixons les yeux paroit-it unique?". With care in following this system of subjective binocular visual analysis, that question can be answered in many, if not in most respects. In 1962 I searched for evidence of end-to-end linkage of the visual fields, using red/green Barlite goggles to view a wide, flat-white featureless ground. With red before the right eye, bi-nasal-retinal dominance was initially apparent, but unstable trans-retinal rivalry supervened centrally, anchored by stable peripheral-temporal fields of the uncorresponded nasal retinae. Other instabilities were noted. Turning the eyes to the right, the nose-contour was either seen, with the binocular field to its left dominated by the left eye, or suppressed, with right eye dominance extending well beyond the midline. Fixating a free-space object, e.g. a pencil, the binocular rivalry was stabilised significantly, though not completely. Crovitz and Lipscomb (1963) reported Köllner's (1914) observations of binocular colour rivalry, including an initial fleeting sensation of binasal hemianopia. They had repeated his experiments tachistoscopically, enhancing and prolonging the effect by foveally combining thin vertical black strips on the coloured lenses. I had noted that, with my single free-space object, overconvergence strengthened bi-nasal-retinal dominance. Underconvergence caused bi-temporal-retinal dominance, each crossed image creating radiating induction, with continued bi-nasal-retinal dominance peripherally. I introduced a second, then a third object in turn, all in the mid-line. The outermost image still radiated dominance outwardly and inwardly, whilst the intermediate ones showed only inwardly radiating dominance. The vertical bisector of a red (right) and green area was observed whilst bi-foveally projecting, with quick side to side saccades, in over- and under-convergence, between and beyond each edge-image. Central-field bi-nasal retinal dominance persisted with bi-nasal uncrossed diplopic edge-contour images and bi-temporal dominance with bi-temporal crossed edge-contour images, the retina imaging the right-hand edge-image dominating more of the binocular ground between the disparate images as bi-foveal projection approached the left- hand image, and vice versa. When the bi-foveal projection point moved beyond either image, that image suffered some suppression, and left the more peripheral image dominant. A complex series of observations followed, with the introduction of an object, first vertical, then tilted, against the vertically bisected red/green background, then both object and edge- contour tilting at different angles relative to one another, from which it was apparent that the term bi-nasal is only appropriate physiologically rather than physically. Panum's Limiting Case exhibited an inherent tendency to unstable stereoscopic signature of the single uncombined line-image. Observations revealed that a monocular image B possesses its own natural stereosign with reference to a binocular fixation object A on the horopter, and depends on the eye in which it is formed. Natural stereosign is assumed to be present when the eye in which the image is formed receives the oculo-motor impulse for transfer of fixation, a normal fixation transfer to that monocular image taking place without change of stereosign. The eye receiving the oculo-motor impulse is assumed, at that time, to dominate the area of binocular-image-ground between that monocular image and the fixation point. When a second, disparate image of B was added, to the same side of, but further away from the fixation object, it imparted its stereosign to the first (inner) image and kept its own. Fixation transfer to either of the disparate images resulted in fusion. With greater disparity, the inner image at times reverted to its own natural stereosign, doubtless due to binocular rivalry changing background dominance. This instability results from un-natural attention factors, and is certainly the cause of the erratic deviations characterising many horopter determinations. The fact that binocular rivalry can change the stereosign of a single monocular image with respect to the fixation point, or that the addition of a second (disparate) image, further removed from the fixation point than the first one, changes the stereosign of the inner one from its natural sign, means that one can consider factors such as aniseikonia, and the Pulfrich Phenomenon in a different light. Contrast invariance in human pattern discrimination It is well known that some pattern discrimination thresholds (e.g. orientation and spatial frequency discrimination) are constant over a wide range of stimulus contrasts. This suggests that at the decision level the signal-to-noise ratio (the mean of the decision variable per its standard deviation) is independent of the contrast of the external signal. This could be explained in two ways. Firstly, the pattern discrimination mechanisms could be sensitive to contrast and be followed by multiplicative noise so that with increasing contrast the noise magnitude increased in the same proportion. Secondly, the pattern discrimination mechanism followed by additive noise could be invariant as to the contrast of the signal but sensitive only to its spatial structure. We tested these hypotheses by measuring pattern discrimination thresholds using stimuli whose contrasts were randomised from one exposure to another. If pattern discrimination mechanisms were sensitive to contrast, the variance of their response would increase resulting in an increase of pattern discrimination thresholds. Thresholds were measured for orientation discrimination of gratings and size discrimination of a facial image as a function of the range of contrast randomisation. The mean Michelson contrast was 0.5 and the randomisation range varied from 0 to ± 0.4. Pattern discrimination thresholds were found to be independent of contrast randomisation in both tasks. This finding, thus, supports the hypothesis that pattern discrimination is performed by mechanisms that code stimulus form independently of contrast. Centroid analysis predicts visual localisation of first- and second-order stimuli The aim of this study was to investigate the characteristics which determine the perceived location for first-order (luminance-defined) and second-order (contrast-defined) stimuli. Perceived Vernier alignment of asymmetric Gaussian-windowed stimuli was measure in 3 experienced and 1 naive observer. The asymmetric Gaussian blobs are similar to convention Gaussian blobs, but have different space constants each side of the vertical midline. These stimuli have the advantage of being continuous in the spatial domain and having well-defined and well-separated characteristics such as luminance peak and centre-of-mass (centroid). The three blobs were identical apart from the fact that the polarity of asymmetry of the central blob was reversed. In order to maintain perceptual alignment, a horizontal shift of the central blob is required, and the magnitude of this shift represents the perceived offset. Results for both luminance- and contrast-defined stimuli are well described on the basis that the centroid of the stimulus envelope represents the spatial primitive which determines perceived visual location. Perceived location of certain types of contrast-defined stimuli was shifted slightly beyond that predicted by the centroid of the Gaussian window. This small departure is probably related to the phenomenon of 'irradiation' in which the perceived size of objects depends not only upon their luminance polarity, but also on their first- or second-order nature. Visual attention over depth and motion in unilateral neglect Depth and motion appear to have a privileged status with respect to visual attention. This is best demonstrated by the findings of Nakayama & Silverman (1) and McLeod, Driver & Crisp (2) on conjunction searches. The selective impairment of perception dependent upon the motion (e.g.3) and depth (e.g.4) attributes of objects provides evidence for the anatomical and functional separation of processing of objects differing along these dimensions. Unilateral visual neglect manifests as a deficit of attention, that is in the absence of visual field deficits, objects in the field contra-lateral to the lesion are ignored. An intriguing interaction between motion and neglect has been demonstrated by Mattingley, Bradshaw & Bradshaw (5) who observed changes in bias on a line bi-section task when moving dots were super-imposed on the test materials. Several simple explanations for such an interaction would be: dragging the eye (optokinetic nystagmus) or attention in the direction of dot motion; the invocation of different perceptual-attentional systems because of the addition of a movement attribute to the test element (relative motion against the background flow); a reduction in attentional bias between the neglected and non-neglected fields due to visual motion acting as common stimulus for attention across the whole test display. The first and third of these maybe investigated by contrasting response to translational and random walk (no global motion) displays. The second through the use of a modified cancellation task: Effects of element density and number have been reported such that the neglect of targets in the contra-lateral field reduces with sparse element displays. If it is possible to group elements by depth or motion attribute and then attend selectively to a particular group then the number of omissions on a cancellation task should drop. Single case study results failed to produce changes in line-bisection bias with superimposed motion. Preliminary results also suggest no reduction of neglect following the division of the elements of stimulus array by adding a motion or depth attribute. Despite the null results, we propose that a principled breakdown of the potential effects of depth and motion is essential to disambiguating findings in this field and the tests outlined afford this level of analysis. 1. Nakayama, K. & Silverman, G.H. (1986) Nature, 320, 264-265. Perception of partly occluded objects by young chicks The world surrounding us is mostly made of opaque objects which partly occlude each other, but human visual perception hardly suffers when objects are partly hidden since the parts that are directly visible usually suffice for recognition of the whole object. The fundamental perceptual process of "amodal" completion (1,2) generates a genuine phenomenal presence of the nonvisible parts. It is unclear, however, whether amodal completion occurs at all in other species: Studies on visual discrimination learning have revealed that animals usually attend to parts and features of the discriminative stimuli rather than to globalobject properties (3). We provide here the first demonstration of recognition of partly occluded objects in a bird species, the domestic chick Gallus gallus, using the naturalistic setting made available by filial imprinting, a process whereby young birds form an attachment to their mother, or some artificial substitute. In a first series of experiments we reared newborn chicks singly with a red cardboard triangle, to which they rapidly imprinted and which they therefore regarded as a true social partner. When chicks were three-days old they were asked to choose between a pair of stimuli composed of either isolated fragments or occluded parts of the imprinting stimulus. Stimuli were equated on features (e.g. area) which the birds might use to make a discrimination. Chicks consistently chose to associate with complete or with partly occluded versions of the imprinting object rather than with separate fragments of it. Similarly, in a second series of experiments, chicks reared with a partly occluded triangle chose to associate with a complete triangle rather than with a fragmented one, whereas chicks reared with a fragmented triangle chose to associate with a fragmented triangle and not with a complete one. Newborn chicks thus appear to behave as if they could experience amodal completion. 1. Michotte, A. (1963). The Perception of Causality. New York: Basic Books. Differences between human scotopic spectral sensitivities in the long-wavelength region Lamb (1) demonstrated over a wide range of species and receptor types a common photoreceptor response when plotted on a logarithmic scale against the reciprocal wavelength, provided this was normalised with respect to the wavelength of maximum sensitivity. He showed this applied to Crawford's averaged data (2) for human scotopic spectral sensitivities. However, Crawford's averages were derived from data for 50 observers, 15 of whom made two sets of measurements by different methods. All these data were published as tables, and when these are examined, it is found that there are considerable differences between the individual responses; in the shapes, the wavelength of maximum response and the slopes in the long-wavelength region. Even when reduced to a common maximum by Lamb's method of plotting, the differences in slope remain. These features are shown in both sets of those data relating to the same individuals, so they probably represent personal characteristics. Why these individual departures from the interesting Lamb model arise remains to be explained. 1. Lamb, T.D., (1995) Vision Res. 35:3083-3091 Visual adaptation to dichoptic brightness disparity induced by artificial anisocoria and neutral density filters Previous work has shown adaptation to interocular brightness differences induced by neutral density filters. The purpose of this study is to investigate the time course of visual adaptation to interocular brightness differences created by artificial anisocoria and neutral density filters. Various levels of anisocoria were created using prosthetic contact lenses (clear pupil, occlusive iris) in subjects previously dilated with 10% phenylephrine. Neutral density filters were employed to produce similar levels of interocular brightness difference under two conditions: (1) with fixed artificial pupils and (2) mobile pupils. Interocular brightness matches were obtained every 15 minutes over a 3 hour period, using a simultaneous interocular brightness matching technique. There was a significant difference in the initial level of interocular brightness difference achieved due to anisocoria and neutral density filters (p<0.01); the brightness differences were smaller in the anisocoric condition. With time the interocular brightness difference caused by the neutral density filters was found to adapt to reach a level similar to that of the anisocoria condition. The induced interocular brightness difference in the anisocoric condition was found to be invariant over the 3 hour test period. Adaptation to interocular brightness differences induced by neutral density filters in this present study concurs with previous reports. The level of adaptation can be anticipated by Fechners' paradox. Brightness matches found with artificial anisocoria are predictable when the Stiles-Crawford effect (Type 1) and Fechners' paradox are taken into account. Computer simulation of interactions responsible for length and width tuning in the striate cortex This presentation describes a simulation of the long-range inhibitory interactions that produce end-stopping, surround-stopping and side-stopping in the primary visual cortex. Unlike previous models, but in line with recent understanding, the approach used here incorporates a divisive rather than a subtractive influence from the receptive field surround, and includes phase independent inhibition from a neural pool with broad orientation and spatial frequency selectivity. This approach is successful at producing realistic tuning for stimulus length and curvature without the problems of false responses or critical dependencies on stimulus position that other researchers have experienced. The simulated interactions also give rise to saliency effects, such as an enhancement of the cortical response to a texture element differing in orientation from its surrounding texture. Additionally, the population orientation estimate given in response to a single pattern element is biased by the presence of suitably oriented surround elements. The resulting orientation tilt illusion can be readily perceived and is reproduced faithfully by the computer simulation. |
||||||||
| webdesign by ablen | The AVA is a registered charity (No: 1049146) |