Trends in Neurosciences
ReviewA matter of time: internal delays in binaural processing
Introduction
Spatial hearing offers a unique window on temporal processing in the nervous system. In contrast to the receptor organs for vision and touch, the cochlea does not have an explicit representation of the spatial position of sound sources, because this organ performs frequency analysis rather than spatial analysis. The spatial position of a sound source is computed in the CNS from implicit information sent downstream by the cochlea. It has long been known that the computation of azimuth (horizontal position of a source) is predominantly based on temporal differences between the two ears, but the underlying mechanisms are currently a matter of much controversy.
Sound sources off the midsagittal plane travel different distances to the two ears and thereby generate interaural time delays (ITDs), both in the arrival time of the stimulus wavefront (‘onset ITD’) and throughout the stimulus (‘ongoing ITD’) (Figure 1d). In humans, ongoing ITDs of low frequencies are the main source of information used to determine horizontal localization of sound 1, 2, 3. Even the largest ITDs, which occur for sound sources that face one ear, are tiny. Their extreme values (henceforth referred to as the ‘ecological range’) are ±700 μs in humans and ±400 μs in cats, but ITDs can be discriminated at values of 10–20 μs [4]. Considering that the duration of an action potential is ∼50 times longer, this acuity is an intriguing biological feat.
Neural sensitivity to ITDs was discovered in the 1960s 5, 6 in the midbrain inferior colliculus and brainstem medial superior olive (MSO), which have binaural neurons whose average firing rate depends on ITD (Figure 1a). Each neuron is tuned to a ‘best delay’ (BD), at which its response is maximal. Neurons differ in their BD, and are maximally excited by sound sources at correspondingly different positions in space. A general finding across studies is a clear bias for tuning to the contralateral hemifield: BDs are mostly at ‘positive’ ITDs, defined as ITDs at which the ear that is contralateral to the neuron is the first to receive the sound. For example, a sound source directly in front of a cat maximally excites neurons on both sides the brain that have a BD of 0 μs, whereas a source placed to the extreme right will excite neurons on the left (i.e. contralateral) side of the brain that have BDs near 400 μs. For each intermediary horizontal position between extreme right and the midline, there are neurons on the left side that are maximally excited.
These physiological observations, in combination with psychophysical work and an influential qualitative model [7], led to a general framework that seemed congruent with general neurobiological principles and that is commonly referred to as ‘the Jeffress model’. This model holds that populations of binaural neurons are tuned both to frequency and to ITD, and that there is a neural ‘display’ in which these neurons are arranged topographically in terms of the frequency by which they are maximally excited (best frequency, BF) and BD. Sound sources cause activity patterns on this BD–BF plane according to their spatial location and frequency characteristics.
In various incarnations, this general model has dominated the field [8] and is the basis of most models of binaural hearing, even though not all of its components are equally well established. However, new data have spawned alternative ideas, for which we here review the evidence. The existence of a BD–BF plane has been questioned, and there are several competing proposals for the physiological mechanisms that underlie the existence of BDs. Because these controversies mostly concern mammals, we do not cover the extensive work on binaural hearing in barn owls [9].
Section snippets
The Jeffress model and axonal delay lines
ITD sensitivity (Figure 1a) is found throughout the central auditory system, and there is evidence that the sensitivity sharpens between the superior olivary complex and the auditory cortex 10, 11. How does ITD-sensitivity arise, and why is the BD at a positive ITD for most neurons? A low-frequency sound source off the midline (Figure 2a) induces a temporal spike pattern that encodes the stimulus waveform, first in the near ear, followed by a similar pattern in the far ear with a delay that
Distribution of best delays
A bias of BDs to positive ITDs (i.e. tuning to contralateral space) has been a consistent finding in many species and at many anatomical levels. In the cat, the range of BDs is largely restricted to ITDs within the ecological range (0–400 μs, with the full range of ± 400 μs subserved by having a left and right MSO) 18, 24. Surprisingly, the overall distribution of BDs in guinea pigs is similar to that of cats, even though their ecological range is much smaller because of head size [25]. Even more
The inhibitory model
It is well-documented that inhibition can underlie or shape ITD sensitivity 34, 35, 36, 37, 38, 39, 40, 41, 42, 43. The MSO also receives bilateral inhibition [44], which is tightly phase-locked for the contralateral ear 39, 45. Brand et al. [20] blocked inhibition of both sides in vivo, by iontophoretic application of strychnine. This gave an increase in response rate and a shift of the BD to 0 ms (Box 1). From these observations, Brand et al. concluded that precise inhibition is essential for
Cochlear disparity
Sound vibrations of the eardrum and middle-ear generate a vibration pattern of the cochlear basilar membrane in the shape of a wave that travels from cochlear base to apex. This traveling wave generates delays, so that low-frequency (apical) nerve fibers are activated later than high-frequency (basal) fibers. If binaural neurons receive a perfectly symmetrical tonotopic innervation, these cochlear delays are inconsequential. Schroeder [50] first proposed that asymmetries in frequency tuning of
Localization versus detection
In the debate on internal delays, teleological arguments are often used. Such arguments are difficult to put to experimental test but are important because they touch on the nature of ITD coding. The existence of large BDs in small-headed animals led to the ‘two-channel’ proposal [26] that horizontal sound position is encoded by the overall activity of one side of the brain relative to the other. In this scheme, BDs are positioned such that the steeply sloping part of the ITD-tuning function is
Concluding remarks: the quest for internal delays
The nature of internal delays and coding of ITDs are still uncertain, and the debate about them touches on many key neurobiological issues. None of the current proposals for the source of internal delay can satisfactorily explain the relationship between BD and BF, which has now been described in several mammals. The multitude of alternatives reflects the facts that extremely small binaural temporal differences can be detected behaviorally and that many processes that have comparatively slow
Acknowledgements
We thank the anonymous reviewers and the following readers for their comments: S. Kuwada, E. Monzack, M. McLaughlin, J. Ruhland and D. Tollin. P.X.J. is supported by the Fund for Scientific Research – Flanders (G.0392.05 and G.0633.07), and Research Fund K.U. Leuven (OT/01/42 and OT/05/57). T.C.T.Y. is supported by NIH grants DC02840 and DC07177.
References (70)
Coincidence detection in the auditory system: 50 years after Jeffress
Neuron
(1998)Interaural delay sensitivity and the classification of low best-frequency binaural responses in the inferior colliculus of the guinea pig
Hear. Res.
(1996)Time is traded for intensity in the bat's auditory system
Hear. Res.
(1988)- et al.
Projections from the lateral nucleus of the trapezoid body to the medial superior olivary nucleus in the gerbil
Hear. Res.
(1992) Sound localization: the role of the commissural pathways of the auditory system of the cat
Brain Res.
(1974)Functional role of the human inferior colliculus in binaural hearing
Hear. Res.
(2002)- et al.
The dominant role of low-frequency interaural time differences in sound localization
J. Acoust. Soc. Am.
(1992) - et al.
Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited
J. Acoust. Soc. Am.
(2002) Lateralization of low-frequency tones: relative potency of gating and ongoing interaural delays
J. Acoust. Soc. Am.
(1991)- et al.
Some measurements of interaural time differences thresholds
J. Acoust. Soc. Am.
(1956)
Some neural mechanisms in the inferior colliculus of the cat which may be relevant to localization of a sound source
J. Neurophysiol.
Response of binaural neurons of dog superior olivary complex to dichotic tonal stimuli: some physiological mechanisms of sound localization
J. Neurophysiol.
A place theory of sound localization
J. Comp. Physiol. Psychol.
Coding of auditory space
Annu. Rev. Neurosci.
A neuronal population code for sound localization
Nature
Neural sensitivity to interaural time differences: beyond the Jeffress model
J. Neurosci.
Projections from the anteroventral cochlear nucleus to the lateral and medial superior olivary nuclei
J. Comp. Neurol.
Posthearing developmental refinement of temporal processing in principal neurons of the medial superior olive
J. Neurosci.
Phase-locked response to low-frequency tones in single auditory nerve fibers of the squirrel monkey
J. Neurophysiol.
Enhancement of synchronization in the anteroventral cochlear nucleus. I. Responses to tonebursts at characteristic frequency
J. Neurophysiol.
Enhanced temporal response properties of anteroventral cochlear nucleus neurons to broadband noise
J. Neurosci.
Interaural time sensitivity in medial superior olive of cat
J. Neurophysiol.
Transformation of binaural response properties in the ascending auditory pathway: influence of time-varying interaural phase disparity
J. Neurophysiol.
Precise inhibition is essential for microsecond interaural time difference coding
Nature
Projections of physiologically characterized spherical bushy cell axons from the cochlear nucleus of the cat: evidence for delay lines to the medial superior olive
J. Comp. Neurol.
Axons from anteroventral cochlear nucleus that terminate in medial superior olive of cat: observations related to delay lines
J. Neurosci.
Topography of interaural temporal disparity coding in projections of medial superior olive to inferior colliculus
J. Neurosci.
Binaural interaction in low-frequency neurons in inferior colliculus of the Cat. III. effects of changing frequency
J. Neurophysiol.
Binaural masking and sensitivity to interaural delay in the inferior colliculus
Philos. Trans. R. Soc. Lond. B Biol. Sci.
A neural code for low-frequency sound localization in mammals
Nat. Neurosci.
A physiologically based model of interaural time difference discrimination
J. Neurosci.
Dependence of binaural and cochlear ‘best delays’ on characteristic frequency
Optimal neural population coding of an auditory spatial cue
Nature
Creating a sense of auditory space
J. Physiol.
Cited by (162)
The continued importance of comparative auditory research to modern scientific discovery
2023, Hearing Research2.36 - Coding of Spatial Information
2020, The Senses: A Comprehensive Reference: Volume 1-7, Second Edition2.19 - Evolution of Central Pathways
2020, The Senses: A Comprehensive Reference: Volume 1-7, Second EditionPrincipal Neurons in the Anteroventral Cochlear Nucleus Express Cell-Type Specific Glycine Receptor α Subunits
2019, NeuroscienceCitation Excerpt :There are three major types of neurons in AVCN, including bushy, T-stellate (also named planar multipolar), and D-stellate (also named radiate multipolar) neurons, each of which exhibits unique features in morphology and intrinsic membrane properties, performs distinct transformation of auditory information, and projects to different target neurons (Cant and Benson, 2003; Doucet and Ryugo, 1997; Oertel, 1991; Oertel et al., 2011; Smith and Rhode, 1989). In particular, bushy neurons encode fine temporal information of sounds that are essential for auditory tasks such as sound localization that require high temporal precision (Joris and Yin, 2007). T-stellate neurons encode information with slower temporal features, such as sound envelope, which are important for auditory tasks like speech recognition (Blackburn and Sachs, 1990; May et al., 1998; Rhode and Greenberg, 1994; Shannon et al., 1995; Swaminathan and Heinz, 2012).