Visual Schemas In Neural Networks For Object Recognition And Scene Analysis (1997)
VISOR is a large connectionist system that shows how visual schemas can be learned, represented, and used through mechanisms natural to neural networks. Processing in VISOR is based on cooperation, competition, and parallel bottom-up and top-down activation of schema representations. Simulations show that VISOR is robust against noise and variations in the inputs and parameters. It can indicate the confidence of its analysis, pay attention to important minor differences, and use context to recognize ambiguous objects. Experiments also suggest that the representation and learning are stable, and its behavior is consistent with human processes such as priming, perceptual reversal, and circular reaction in learning. The schema mechanisms of VISOR can serve as a starting point for building robust high-level vision systems, and perhaps for schema-based motor control and natural language processing systems as well.
In Michael A. Arbib, editors, The Handbook of Brain Theory and Neural Networks, 1029-1031, Cambridge, MA, 1997. MIT Press.

Wee Kheng Leow Ph.D. Alumni leowwk [at] comp nus edu sg
Risto Miikkulainen Faculty risto [at] cs utexas edu