Recognition of visual activities and interactions by stochastic parsing

  • Yuri A. Ivanov
  • , Aaron F. Bobick

Research output: Contribution to journalArticlepeer-review

547 Scopus citations

Abstract

This paper describes a probabilistic syntactic approach to the detection and recognition of temporally extended activities and interactions between multiple agents. The fundamental idea is to divide the recognition problem into two levels. The lower level detections are performed using standard independent probabilistic event detectors to propose candidate detections of low-level features. The outputs of these detectors provide the input stream for a stochastic context-free grammar parsing mechanism. The grammar and parser provide longer range temporal constraints, disambiguate uncertain low-level detections, and allow the inclusion of a priori knowledge about the structure of temporal events in a given domain. To achieve such a system we: 1) provide techniques for generating a discrete symbol stream from continuous low-level detectors; 2) extend stochastic context-free parsing to handle uncertainty in the input symbol stream; 3) augment a run-time parsing algorithm to enforce intersymbol constraints such as requiring temporal consistency between primitives; and 4) extend the consistency filtering to maintain consistent multiobject interactions. We develop a real-time system and demonstrate the approach in several experiments on gesture recognition and in video surveillance. In the surveillance application, we show how the system correctly interprets activities of multiple, interacting objects.

Original languageEnglish
Pages (from-to)852-872
Number of pages21
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume22
Issue number8
DOIs
StatePublished - Aug 2000

Fingerprint

Dive into the research topics of 'Recognition of visual activities and interactions by stochastic parsing'. Together they form a unique fingerprint.

Cite this