Research articles

An information theory perspective on computational vision

Expand
  • Department of Statistics, University of California at Los Angeles, Los Angeles, CA 90095, USA;

Published date: 05 Sep 2010

Abstract

This paper introduces computer vision from an information theory perspective. We discuss how vision can be thought of as a decoding problem where the goal is to find the most efficient encoding of the visual scene. This requires probabilistic models which are capable of capturing the complexity and ambiguities of natural images. We start by describing classic Markov Random Field (MRF) models of images. We stress the importance of having efficient inference and learning algorithms for these models and emphasize those approaches which use concepts from information theory. Next we introduce more powerful image models that have recently been developed and which are better able to deal with the complexities of natural images. These models use stochastic grammars and hierarchical representations. They are trained using images from increasingly large databases. Finally, we described how techniques from information theory can be used to analyze vision models and measure the effectiveness of different visual cues.

Cite this article

Alan YUILLE, . An information theory perspective on computational vision[J]. Frontiers of Electrical and Electronic Engineering, 2010 , 5(3) : 329 -346 . DOI: 10.1007/s11460-010-0107-x

Outlines

/