Abstract
Most view-based vision algorithms are based on strong assumptions about the disposition of the objects in the image. To safely apply those algorithms in real world image sequences, we propose that a vision system should be divided into two components. The first component contains an approximate world model of the scene - a low accuracy, coarse description of the objects and actions in the world. Approximate world models are constructed and updated by simple vision routines and by the use of action information provided by an external source. The second component employs view-based algorithms to perform required perceptual tasks; the selection and control of the view-based methods are determined by the information provided by the approximate world model. We demonstrate the approximate world model approach in a project to control cameras in a TV studio where the external context is provided by a script.
| Original language | English |
|---|---|
| Pages (from-to) | 955-961 |
| Number of pages | 7 |
| Journal | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
| State | Published - 1997 |
| Event | Proceedings of the 1997 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - San Juan, PR, USA Duration: Jun 17 1997 → Jun 19 1997 |