See me, hear me, touch me: multisensory integration in lateral occipital-temporal cortex

doi:10.1016/j.conb.2005.03.011

Current Opinion in Neurobiology

Volume 15, Issue 2, April 2005, Pages 145-153

https://doi.org/10.1016/j.conb.2005.03.011 Get rights and content

Our understanding of multisensory integration has advanced because of recent functional neuroimaging studies of three areas in human lateral occipito-temporal cortex: superior temporal sulcus, area LO and area MT (V5). Superior temporal sulcus is activated strongly in response to meaningful auditory and visual stimuli, but responses to tactile stimuli have not been well studied. Area LO shows strong activation in response to both visual and tactile shape information, but not to auditory representations of objects. Area MT, an important region for processing visual motion, also shows weak activation in response to tactile motion, and a signal that drops below resting baseline in response to auditory motion. Within superior temporal sulcus, a patchy organization of regions is activated in response to auditory, visual and multisensory stimuli. This organization appears similar to that observed in polysensory areas in macaque superior temporal sulcus, suggesting that it is an anatomical substrate for multisensory integration. A patchy organization might also be a neural mechanism for integrating disparate representations within individual sensory modalities, such as representations of visual form and visual motion.

Introduction

In everyday life, perceptual events often occur in multiple sensory modalities at once: we hear someone speaking as we see their mouth move. Most scientific investigations have focused on single modalities (frequently vision) in isolation. Recently, there has been increasing interest in studying integration across sensory modalities. In this review, I discuss progress in studying the brain mechanisms of multisensory integration in human lateral occipital-temporal cortex, especially functional magnetic resonance imaging (fMRI) studies of superior temporal sulcus (STS), area LO and area MT (see glossary for a brief definition of these terms). Links between human neuroimaging studies and studies in non-human primates are made using techniques from computational neuroanatomy that permit alignment of human and monkey brains.

An ongoing discussion concerns the appropriate methods for studying multisensory integration using fMRI [1•, 2•, 3]. One important method is to contrast unisensory stimulation conditions with multisensory conditions. The hallmark of multisensory integration is that unisensory stimuli presented in combination produce an effect different from the linear combination of the unisensory stimuli presented separately. In individual neurons, these differences can be quite dramatic, with multisensory responses that are much greater than the sum of individual unisensory responses (‘super-additivity’). However, because fMRI measurements integrate across thousands or millions of neurons, the super-additivity measure might not be appropriate [2^•]. Instead, increasingly liberal criteria might be more suitable, such as requiring only that multisensory responses are greater than the maximum or mean of the individual unisensory responses [1^•].

Another important issue is the high degree of inter-subject and -laboratory variability observed in fMRI studies. STS, LO and MT are attractive targets for a review because there is some consensus on their anatomical location. This is either because they constitute an anatomical structure observed in every normal human hemisphere (such as STS) or because their response properties make it possible to identify them with functional localizers (somewhat ambiguously for LO, unambiguously for MT). By starting out with well-defined regions, a review can sidestep some of the difficulties inherent in deciding if a stereotaxic coordinate reported in one study of multisensory integration corresponds to the same cortical region as a coordinate from a different study.

Although STS, LO and MT are found in relative proximity, within the space of a few centimeters in human lateral occipital temporal cortex, their multisensory response properties are quite different, as is our level of knowledge about their role in multisensory perception. Therefore, this review attempts to compare and contrast the activity in these three areas in response to stimuli in three sensory modalities — visual, auditory and tactile. Figure 1 illustrates the location of STS, LO and MT in folded and inflated versions of a human brain, and their relationship to Brodmann's cytoarchitectonic classification scheme.

Glossary

Area MT (V5): A region in extrastriate visual cortex distinguished by its heavy myelination and specialization for processing visual motion. It was first described in the posterior middle temporal cortex of owl monkey [53], leading to the designation MT. In macaque monkeys, this region lies in the posterior bank of the superior temporal sulcus, where some investigators have designated it V5 [54]. A homologous region has been found in many other species, including humans, where it lies near the junction of the inferior temporal sulcus and the lateral occipital sulcus [55].
Congruent and incongruent stimuli: Because different sensory modalities can be stimulated independently in an experimental setting, multisensory stimuli can be congruent (such as a picture of a car presented with the sound of a car) or incongruent (such as a picture of a car presented with the sound of a telephone).
fMRI (functional magnetic resonance imaging): A non-invasive method for measuring neuronal activity, typically with an indirect measure such as blood-oxygenation level dependent (BOLD) contrast.
Localizer: There is only a rough correlation between visible anatomical structures (such as specific sulci or gyri) and the functional areas that comprise the computational organization of the brain. However, in order to make inferences about organization, it is important to compare the same functional area across subjects. A common technique is to use a localizer fMRI scan (for instance, alternating moving and static stimuli) in order to identify a specific region of interest (for instance, area MT). Additional experiments are then performed and the results compared across subjects within this region.
Multisensory: Refers to the processing of stimuli presented in multiple sensory modalities at once. Although the term ‘multimodal’ is sometimes used as a synonym for multisensory, it is also used to describe studies that use multiple measurement techniques, such as fMRI and magnetoencephalography (MEG). Therefore, the term multisensory is preferred.
Synchronous and asynchronous stimuli: An experimental manipulation that involves artificially changing the temporal offset between stimuli presented in different sensory modalities in order to measure the effect on multisensory integration. For instance, the discomforting sensation when the dialogue in the sound track of a movie is offset from the images.

Section snippets

Multisensory integration in superior temporal sulcus

There is compelling evidence for auditory and visual responses in human STS to a variety of stimuli. (For a review of all regions important for multisensory identification and object recognition, please see Amedi et al. [4]). Because it extends over a large area of cortex, STS certainly contains several functional regions. However, the parcellation of human STS is poorly understood, and in this review STS is used as shorthand for ‘the constellation of cortical areas with multisensory response

Multisensory responses in human and monkey superior temporal sulcus

In macaque, an important multisensory region lies along the fundus of the STS. This region was functionally defined as the superior temporal polysensory (STP) area on the basis of single cell recordings [11] and probably corresponds to the region in macaques that was anatomically defined as temporal–parietal–occipital (TPO) [12]. Although the visual responses of many areas in macaque STS have been characterized, recent neuroimaging studies in macaque demonstrate that complex, behaviorally

Area LO

Area LO was first described as a region of human lateral occipital cortex, just ventral and posterior to area MT, that responded preferentially to images of objects versus those of textured patterns [22]. LO is thought to be important for processing visual shape information [23]. More recently, studies showing that an extended band of visual cortex responds preferentially to images versus patterns [24•, 25] has led to confusion over the location and identity of LO. Figure 2 illustrates the

Area MT

Area MT is recognized as a key locus for visual motion processing in the primate brain (see glossary). In macaque monkeys, MT is located in the lower bank of the STS (Figure 2b), whereas in humans, MT is located in lateral occipital cortex (Figure 2c). This review refers to ‘MT’ as a single area for simplicity, although this region of cortex contains several motion-responsive areas that are grouped together in most imaging studies, often under the rubric MT+ [35].

There are strong

Commonalities between integration across and within modalities

As discussed above, one of the neural substrates for multisensory integration in STS might be a patchy organization, in which neighboring patches respond primarily to unisensory auditory or visual information. Unisensory information might be translated into a common code and integrated in multisensory regions that lie between the unisensory patches. Such an organization might also be amenable to integration of other types of information. Neurons in STS can be selective to both visual form and

The object property model

It is also useful to consider the relationship between multisensory and category-related responses. One of the most surprising findings to arise from recent functional neuroimaging studies is that specific regions of human visual cortex respond preferentially to specific categories of objects. For instance, parts of lateral temporal cortex (middle temporal gyrus and inferior temporal sulcus, including portions of areas MT and LO) respond preferentially to images of man-made graspable objects

Conclusions and future directions

Neuroimaging studies in humans and non-human primates using the same multisensory stimuli will be crucial for forming a link between human neurobiology and the anatomical and physiological insight that can only be obtained from invasive studies. The results of these experiments, combined with advances in neuroimaging methods applicable in humans, such as high-resolution fMRI and MEG, mean that the next few years will surely see further great strides in our understanding of multisensory

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

• of special interest
•• of outstanding interest

Acknowledgements

D Van Essen, D Hanlon, J Dickson, P Christidis and Z Saad were instrumental in figure preparation. A Martin and A Amedi provided helpful comments on the manuscript. This work was supported by the National Institute of Mental Health Intramural Research Program.

References (55)

M.S. Beauchamp et al.
Parallel visual motion processing streams for manipulable objects and human movements
Neuron
(2002)
N. Van Atteveldt et al.
Integration of letters and speech sounds in the human brain
Neuron
(2004)
Argall BD, Saad ZS, Beauchamp MS: A simplified method for intersubject averaging on the cortical surface using SUMA....
J. Padberg et al.
Architectonics and cortical connections of the upper bank of the superior temporal sulcus in the rhesus monkey: an analysis in the tangential plane
J Comp Neurol
(2003)
A. Poremba et al.
Functional mapping of the primate auditory system
Science
(2003)
J. Bodurka et al.
Scalable multichannel MRI data acquisition system
Magn Reson Med
(2004)
P.J. Laurienti et al.
Deactivation of sensory-specific cortex by cross-modal stimuli
J Cogn Neurosci
(2002)
S. Soto-Faraco et al.
Multisensory contributions to the perception of motion
Neuropsychologia
(2003)
Saad ZS, Reynolds RC, Argall BD, Japee S, Cox RW: Suma: an interface for surface-based intra- and inter-subject...
J.M. Allman et al.
Representation of the visual field in striate and adjoining cortex of the owl monkey (Aotus trivirgatus)
Brain Res
(1971)

S. Zeki

The distribution of wavelength and orientation selective cells in different areas of monkey visual cortex

Proc R Soc Lond B Biol Sci

(1983)

Beauchamp MS: Statistical criteria in fMRI studies of multisensory integration. Neuroinformatics 2005. In...

Laurienti PJ, Perrault TJ Jr, Stanford TR, Wallace MT, Stein BE: On the use of superadditivity as a metric for...

G.A. Calvert et al.

Multisensory integration: methodological approaches and emerging principles in the human brain

J Physiol (Paris)

(2004)

Amedi A, Kriegstein KV, Van Atteveldt N, Beauchamp MS, Naumer M: Functional imaging of human crossmodal identification...

T.M. Wright et al.

Polysensory interactions along lateral temporal regions evoked by audiovisual speech

Cereb Cortex

(2003)

E. Macaluso et al.

Spatial and temporal factors during processing of audiovisual speech: a PET study

Neuroimage

(2004)

M.S. Beauchamp et al.

Integration of auditory and visual information about objects in superior temporal sulcus

Neuron

(2004)

C. Bruce et al.

Visual properties of neurons in a polysensory area in superior temporal sulcus of the macaque

J Neurophysiol

(1981)

R. Gil-da-Costa et al.

Toward an evolutionary perspective on conceptual representation: species-specific calls activate visual and affective processing systems in the macaque

Proc Natl Acad Sci USA

(2004)

D.C. Van Essen

Surface-based approaches to spatial localization and registration in primate cerebral cortex

Neuroimage

(2004)

H. Burton et al.

Cortical activity to vibrotactile stimulation: an fMRI study in blind and sighted individuals

Hum Brain Mapp

(2004)

B. Seltzer et al.

Overlapping and nonoverlapping cortical projections to cortex of the superior temporal sulcus in the rhesus monkey: double anterograde tracer studies

J Comp Neurol

(1996)

C.G. Cusick et al.

Chemoarchitectonics and corticocortical terminations within the superior temporal sulcus of the rhesus monkey: evidence for subdivisions of superior temporal polysensory cortex

J Comp Neurol

(1995)

M.S. Beauchamp et al.

Unraveling multisensory integration: patchy organization within human STS multisensory cortex

Nat Neurosci

(2004)

J.A. De Zwart et al.

Signal to- noise ratio and parallel imaging performance of a 16-channel receive-only brain coil array at 3.0 Tesla

Magn Reson Med

(2004)

R. Malach et al.

Object-related activity revealed by functional magnetic resonance imaging in human occipital cortex

Proc Natl Acad Sci USA

(1995)

Cited by (311)

Unravelling the multisensory learning advantage: Different patterns of within and across frequency-specific interactions drive uni- and multisensory neuroplasticity
2024, NeuroImage
In the field of learning theory and practice, the superior efficacy of multisensory learning over uni-sensory is well-accepted. However, the underlying neural mechanisms at the macro-level of the human brain remain largely unexplored. This study addresses this gap by providing novel empirical evidence and a theoretical framework for understanding the superiority of multisensory learning. Through a cognitive, behavioral, and electroencephalographic assessment of carefully controlled uni-sensory and multisensory training interventions, our study uncovers a fundamental distinction in their neuroplastic patterns. A multilayered network analysis of pre- and post- training EEG data allowed us to model connectivity within and across different frequency bands at the cortical level. Pre-training EEG analysis unveils a complex network of distributed sources communicating through cross-frequency coupling, while comparison of pre- and post-training EEG data demonstrates significant differences in the reorganizational patterns of uni-sensory and multisensory learning. Uni-sensory training primarily modifies cross-frequency coupling between lower and higher frequencies, whereas multisensory training induces changes within the beta band in a more focused network, implying the development of a unified representation of audiovisual stimuli. In combination with behavioural and cognitive findings this suggests that, multisensory learning benefits from an automatic top-down transfer of training, while uni-sensory training relies mainly on limited bottom-up generalization. Our findings offer a compelling theoretical framework for understanding the advantage of multisensory learning.
Effects of noise and noise reduction on audiovisual speech perception in cochlear implant users: An ERP study
2023, Clinical Neurophysiology
Hearing with a cochlear implant (CI) is difficult in noisy environments, but the use of noise reduction algorithms, specifically ForwardFocus, can improve speech intelligibility. The current event-related potentials (ERP) study examined the electrophysiological correlates of this perceptual improvement.
Ten bimodal CI users performed a syllable-identification task in auditory and audiovisual conditions, with syllables presented from the front and stationary noise presented from the sides. Brainstorm was used for spatio-temporal evaluation of ERPs.
CI users revealed an audiovisual benefit as reflected by shorter response times and greater activation in temporal and occipital regions at P2 latency. However, in auditory and audiovisual conditions, background noise hampered speech processing, leading to longer response times and delayed auditory-cortex-activation at N1 latency. Nevertheless, activating ForwardFocus resulted in shorter response times, reduced listening effort and enhanced superior-frontal-cortex-activation at P2 latency, particularly in audiovisual conditions.
ForwardFocus enhances speech intelligibility in audiovisual speech conditions by potentially allowing the reallocation of attentional resources to relevant auditory speech cues.
This study shows for CI users that background noise and ForwardFocus differentially affect spatio-temporal cortical response patterns, both in auditory and audiovisual speech conditions.
Similarities and differences in the neural correlates of letter and speech sound integration in blind and sighted readers
2023, NeuroImage
Learning letter and speech sound (LS) associations is a major step in reading acquisition common for all alphabetic scripts, including Braille used by blind readers. The left superior temporal cortex (STC) plays an important role in audiovisual LS integration in sighted people, but it is still unknown what neural mechanisms are responsible for audiotactile LS integration in blind individuals. Here, we investigated the similarities and differences between LS integration in blind Braille (N = 42, age range: 9–60 y.o.) and sighted print (N = 47, age range: 9–60 y.o.) readers who acquired reading using different sensory modalities. In both groups, the STC responded to both isolated letters and isolated speech sounds, showed enhanced activation when they were presented together, and distinguished between congruent and incongruent letter and speech sound pairs. However, the direction of the congruency effect was different between the groups. Sighted subjects showed higher activity for incongruent LS pairs in the bilateral STC, similarly to previously studied typical readers of transparent orthographies. In the blind, congruent pairs resulted in an increased response in the right STC. These differences may be related to more sequential processing of Braille as compared to print reading. At the same time, behavioral efficiency in LS discrimination decisions and the congruency effect were found to be related to age and reading skill only in sighted participants, suggesting potential differences in the developmental trajectories of LS integration between blind and sighted readers.
Benefit of visual speech information for word comprehension in post-stroke aphasia
2023, Cortex
Aphasia is a language disorder that often involves speech comprehension impairments affecting communication. In face-to-face settings, speech is accompanied by mouth and facial movements, but little is known about the extent to which they benefit aphasic comprehension. This study investigated the benefit of visual information accompanying speech for word comprehension in people with aphasia (PWA) and the neuroanatomic substrates of any benefit. Thirty-six PWA and 13 neurotypical matched control participants performed a picture-word verification task in which they indicated whether a picture of an animate/inanimate object matched a subsequent word produced by an actress in a video. Stimuli were either audiovisual (with visible mouth and facial movements) or auditory-only (still picture of a silhouette) with audio being clear (unedited) or degraded (6-band noise-vocoding). We found that visual speech information was more beneficial for neurotypical participants than PWA, and more beneficial for both groups when speech was degraded. A multivariate lesion-symptom mapping analysis for the degraded speech condition showed that lesions to superior temporal gyrus, underlying insula, primary and secondary somatosensory cortices, and inferior frontal gyrus were associated with reduced benefit of audiovisual compared to auditory-only speech, suggesting that the integrity of these fronto-temporo-parietal regions may facilitate cross-modal mapping. These findings provide initial insights into our understanding of the impact of audiovisual information on comprehension in aphasia and the brain regions mediating any benefit.
10-Year trajectories of depressive symptoms and subsequent brain health in middle-aged adults
2023, Journal of Psychiatric Research
Depressive symptoms differ in severity and stability over time. Trajectories depicting these changes, particularly those with high late-life depressive symptoms, have been associated with poor brain health at old age. To better understand these associations across the lifespan, we examined depressive symptoms trajectories in relation to brain health in middle age. We included 1676 participants from the ORACLE Study, all were expecting a child at baseline (mean age 32.8, 66.6% women). Depressive symptoms were assessed at baseline, 3 years and 10 years after baseline. Brain health (global brain volume, subcortical structures volume, white matter lesions, cerebral microbleeds, cortical thickness, cortical surface area) was assessed 15 years after baseline. Using k-means clustering, four depressive symptoms trajectories were identified: low, low increasing, decreasing, and high increasing symptoms. The high increasing trajectory was associated with smaller brain volume compared to low symptoms, not surviving multiple testing correction. The low increasing trajectory was associated with more cortical thickness in a small region encompassing the right lateral occipital cortex compared to low symptoms. These findings show that longitudinal depressive symptoms trajectories are only minimally associated with brain health in middle age, suggesting that associations may only emerge later in life.
Knowing what you feel: Inferior frontal gyrus-based structural and functional neural patterns underpinning adaptive body awareness
2022, Journal of Affective Disorders
Heightened body awareness (BA) is conducive for increasing understanding of bodily state and improves individuals' health and well-being. Although there has been cumulative research concentrating on the self-perceived tendency to focus on negatively valenced interoceptive sensations, the specific structural and functional neural patterns underlying BA and their role in the relationship between BA and individual well-being remain unclear.
Voxel-based morphometry and whole brain functional connectivity analyses were conducted to examine the structural and functional neural patterns, respectively, in 686 healthy subjects. BA and subjective well-being were assessed using questionnaires.
BA was inversely related to gray matter volume of the right inferior frontal gyrus, opercular part (IFGoperc). Higher BA was correlated with enhanced IFGoperc-precuneus and IFGoperc-anterior supramarginal gyrus connectivities, and with decreased IFGoperc-lateral occipital cortex and IFGoperc-medial frontal cortex connectivities. The inferior frontal gyrus, triangular part (in the fronto-parietal task control network) acted as the hub that linked the sensory/somatomotor network, the default mode network, and the dorsal and ventral attention network. The IFGoperc-precuneus connectivity moderated the association between BA and subjective well-being.
We were unable to rank all the networks by their relative importance, because the absolute weighted value in each module was not calculated.
Our findings demonstrated that BA was reflected by specific neural patterns mainly involved in cognitive-affective control, attentional and self-referential processing, as well as multisensory integration, which could offer some references for current therapies (e.g., mindfulness, yoga training) that are dedicated to solving health problems and improving individual well-being.

View all citing articles on Scopus

View full text

See me, hear me, touch me: multisensory integration in lateral occipital-temporal cortex

Introduction

Section snippets

Multisensory integration in superior temporal sulcus

Multisensory responses in human and monkey superior temporal sulcus

Area LO

Area MT

Commonalities between integration across and within modalities

The object property model

Conclusions and future directions

References and recommended reading

Acknowledgements

Neuron

Neuron

J Comp Neurol

Science

Magn Reson Med

J Cogn Neurosci

Neuropsychologia

Brain Res

Proc R Soc Lond B Biol Sci

Multisensory integration: methodological approaches and emerging principles in the human brain

J Physiol (Paris)

Polysensory interactions along lateral temporal regions evoked by audiovisual speech

Cereb Cortex

Spatial and temporal factors during processing of audiovisual speech: a PET study

Neuroimage

Integration of auditory and visual information about objects in superior temporal sulcus

Neuron

Visual properties of neurons in a polysensory area in superior temporal sulcus of the macaque

J Neurophysiol

Toward an evolutionary perspective on conceptual representation: species-specific calls activate visual and affective processing systems in the macaque

Proc Natl Acad Sci USA

Surface-based approaches to spatial localization and registration in primate cerebral cortex

Neuroimage

Cortical activity to vibrotactile stimulation: an fMRI study in blind and sighted individuals

Hum Brain Mapp

Overlapping and nonoverlapping cortical projections to cortex of the superior temporal sulcus in the rhesus monkey: double anterograde tracer studies

J Comp Neurol

Chemoarchitectonics and corticocortical terminations within the superior temporal sulcus of the rhesus monkey: evidence for subdivisions of superior temporal polysensory cortex

J Comp Neurol

Unraveling multisensory integration: patchy organization within human STS multisensory cortex

Nat Neurosci

Signal to- noise ratio and parallel imaging performance of a 16-channel receive-only brain coil array at 3.0 Tesla

Magn Reson Med

Object-related activity revealed by functional magnetic resonance imaging in human occipital cortex

Proc Natl Acad Sci USA