Abstract:
How do our brains transform patterns of light striking the retina into useful knowledge about objects and events of the external world? Thanks to intense research into the mechanisms of vision, much is now known about this process. However, we do not yet have anything close to a complete picture, and many questions remain unanswered. In addition to its clinical relevance and purely academic significance, research on vision is important because a thorough understanding of biological vision would probably help solve many major problems in computer vision.
A major framework for investigating the computational basis of vision is what might be called the probabilistic view of vision. This approach emphasizes the general importance of uncertainty and probabilities in perception and, in particular, suggests that perception is tightly linked to the statistical structure of the natural environment. This thesis investigates this link by building statistical models of natural images, and relating these to what is known of the information processing performed by the early stages of the primate visual system.
Recently, it was suggested that the response properties of simple cells in the primary visual cortex could be interpreted as the result of the cells performing an independent component analysis of the natural visual sensory input. This thesis provides some further support for that proposal, and, more importantly, extends the theory to also account for complex cell properties and the columnar organization of the primary visual cortex. Finally, the application of these methods to predicting neural response properties further along the visual pathway is considered.
Although the models considered account for only a relatively small part of known facts concerning early visual information processing, it is nonetheless a rather impressive amount considering the simplicity of the models. This is encouraging, and suggests that many of the intricacies of visual information processing might be understood using fairly simple probabilistic models of natural sensory input.
|
Parts:
P. O. Hoyer and A. Hyvärinen, Independent component analysis applied to feature extraction from colour and stereo images, Network: Computation in Neural Systems, vol. 11, no. 3, pp. 191-210, 2000.A. Hyvärinen and P. O. Hoyer, Emergence of phase and shift invariant features by decomposition of natural images into independent feature subspaces, Neural Computation, vol. 12, no. 7, pp. 1705-1720, 2000.A. Hyvärinen and P. O. Hoyer, A two-layer sparse coding model learns simple and complex cell receptive fields and topography from natural images, Vision Research, vol. 41, no. 18, pp. 2413-2423, 2001.P. O. Hoyer, Non-negative sparse coding, in Neural Networks for Signal Processing XII (Proc. IEEE Workshop on Neural Networks for Signal Processing 2002, Martigny, Switzerland), pp. 557-565, 2002.P. O. Hoyer, Modeling receptive fields with non-negative sparse coding, in Computational Neuroscience: Trends in Research 2003, Elsevier, Amsterdam, 2003. In press.P. O. Hoyer and A. Hyvärinen, A multi-layer sparse coding network learns contour coding from natural images, Vision Research, vol. 42, no. 12, pp. 1593-1605, 2002.
|