English
 
User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

A simple algorithm that discovers efficient perceptual codes

MPS-Authors
There are no MPG-Authors available
External Ressource
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Frey, B., Dayan, P., & Hinton, G. (1997). A simple algorithm that discovers efficient perceptual codes. Computational and Psychophysical Mechanisms of Visual Coding, 296-315.


Cite as: http://hdl.handle.net/21.11116/0000-0007-55A3-1
Abstract
We describe the 'wake-sleep' algorithm that allows a multilayer, unsupervised, neural network to build a hierarchy of representations of sensory input. The network has bottom-up 'recognition' connections that are used to convert sensory input into underlying representations. Unlike most artificial neural networks, it also has top-down 'generative' connections that can be used to reconstruct the sensory input from the representations. In the 'wake' phase of the learning algorithm, the network is driven by the bottom-up recognition connections and the top-down generative connections are trained to be better at reconstructing the sensory input from the representation chosen by the recognition process. In the 'sleep' phase, the network is driven top-down by the generative connections to produce a fantasized representation and a fantasized sensory input. The recognition connections are then trained to be better at recovering the fantasized representation from the fantasized sensory input. In both phases, the synaptic learning rule is simple and local. The combined effect of the two phases is to create representations of the sensory input that are efficient in the following sense: On average, it takes more bits to describe each sensory input vector directly than to first describe the representation of the sensory input chosen by the recognition process and then describe the difference between the sensory input and its reconstruction from the chosen representation.