TY - JOUR
T1 - Simple and complex cells revisited
T2 - toward a selectivity-invariance model of object recognition
AU - Li, Xin
AU - Wang, Shuo
N1 - Publisher Copyright:
Copyright © 2023 Li and Wang.
PY - 2023
Y1 - 2023
N2 - This paper presents a theoretical perspective on modeling ventral stream processing by revisiting the computational abstraction of simple and complex cells. In parallel to David Marr's vision theory, we organize the new perspective into three levels. At the computational level, we abstract simple and complex cells into space partitioning and composition in a topological space based on the redundancy exploitation hypothesis of Horace Barlow. At the algorithmic level, we present a hierarchical extension of sparse coding by exploiting the manifold constraint in high-dimensional space (i.e., the blessing of dimensionality). The resulting over-parameterized models for object recognition differ from existing hierarchical models by disentangling the objectives of selectivity and invariance computation. It is possible to interpret our hierarchical construction as a computational implementation of cortically local subspace untangling for object recognition and face representation, which are closely related to exemplar-based and axis-based coding in the medial temporal lobe. At the implementation level, we briefly discuss two possible implementations based on asymmetric sparse autoencoders and divergent spiking neural networks.
AB - This paper presents a theoretical perspective on modeling ventral stream processing by revisiting the computational abstraction of simple and complex cells. In parallel to David Marr's vision theory, we organize the new perspective into three levels. At the computational level, we abstract simple and complex cells into space partitioning and composition in a topological space based on the redundancy exploitation hypothesis of Horace Barlow. At the algorithmic level, we present a hierarchical extension of sparse coding by exploiting the manifold constraint in high-dimensional space (i.e., the blessing of dimensionality). The resulting over-parameterized models for object recognition differ from existing hierarchical models by disentangling the objectives of selectivity and invariance computation. It is possible to interpret our hierarchical construction as a computational implementation of cortically local subspace untangling for object recognition and face representation, which are closely related to exemplar-based and axis-based coding in the medial temporal lobe. At the implementation level, we briefly discuss two possible implementations based on asymmetric sparse autoencoders and divergent spiking neural networks.
KW - cortically local subspace untangling
KW - invariance computation
KW - object recognition
KW - selectivity computation
KW - simple and complex cells
UR - http://www.scopus.com/inward/record.url?scp=85175065491&partnerID=8YFLogxK
U2 - 10.3389/fncom.2023.1282828
DO - 10.3389/fncom.2023.1282828
M3 - Article
C2 - 37905187
AN - SCOPUS:85175065491
SN - 1662-5188
VL - 17
JO - Frontiers in Computational Neuroscience
JF - Frontiers in Computational Neuroscience
M1 - 1282828
ER -