The Invariance Hypothesis Implies Domain-Specific Regions in Visual Cortex

Joel Z Leibo; Qianli Liao; Fabio Anselmi; Tomaso Poggio

doi:10.1371/journal.pcbi.1004390

The Invariance Hypothesis Implies Domain-Specific Regions in Visual Cortex

PLoS Comput Biol. 2015 Oct 23;11(10):e1004390. doi: 10.1371/journal.pcbi.1004390. eCollection 2015 Oct.

Authors

Joel Z Leibo¹, Qianli Liao¹, Fabio Anselmi², Tomaso Poggio²

Affiliations

¹ Center for Brains, Minds, and Machines, MIT, Cambridge, Massachusetts, United States of America; McGovern Institute for Brain Research, MIT, Cambridge, Massachusetts, United States of America.
² Center for Brains, Minds, and Machines, MIT, Cambridge, Massachusetts, United States of America; McGovern Institute for Brain Research, MIT, Cambridge, Massachusetts, United States of America; Istituto Italiano di Tecnologia, Genova, Italy.

Abstract

Is visual cortex made up of general-purpose information processing machinery, or does it consist of a collection of specialized modules? If prior knowledge, acquired from learning a set of objects is only transferable to new objects that share properties with the old, then the recognition system's optimal organization must be one containing specialized modules for different object classes. Our analysis starts from a premise we call the invariance hypothesis: that the computational goal of the ventral stream is to compute an invariant-to-transformations and discriminative signature for recognition. The key condition enabling approximate transfer of invariance without sacrificing discriminability turns out to be that the learned and novel objects transform similarly. This implies that the optimal recognition system must contain subsystems trained only with data from similarly-transforming objects and suggests a novel interpretation of domain-specific regions like the fusiform face area (FFA). Furthermore, we can define an index of transformation-compatibility, computable from videos, that can be combined with information about the statistics of natural vision to yield predictions for which object categories ought to have domain-specific regions in agreement with the available data. The result is a unifying account linking the large literature on view-based recognition with the wealth of experimental evidence concerning domain-specific regions.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Animals
Computer Simulation
Humans
Models, Neurological*
Nerve Net / physiology*
Pattern Recognition, Visual / physiology*
Recognition, Psychology / physiology*
Visual Cortex / physiology*
Visual Pathways / physiology*

Grants and funding

This material is based upon work supported by the Center for Brains, Minds, and Machines (CBMM), funded by NSF STC award CCF-1231216. URL: http://cbmm.mit.edu/ (TP). This research was also sponsored by grants from the National Science Foundation (NSF-0640097, NSF-0827427) URL: http://www.nsf.gov/ (TP), and the Air Force Office of Scientific Research AFOSR-THRL (FA8650-05-C-7262) URL: www.afosr.af.mil (TP). Additional support was provided by the Eugene McDermott Foundation (TP). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.