Why did the person cross the road (there)? Scene understanding using probabilistic logic models and common sense reasoning

Aniruddha Kembhavi, Tom Yeh, Larry S. Davis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

We develop a video understanding system for scene elements, such as bus stops, crosswalks, and intersections, that are characterized more by qualitative activities and geometry than by intrinsic appearance. The domain models for scene elements are not learned from a corpus of video, but instead, naturally elicited by humans, and represented as probabilistic logic rules within a Markov Logic Network framework. Human elicited models, however, represent object interactions as they occur in the 3D world rather than describing their appearance projection in some specific 2D image plane. We bridge this gap by recovering qualitative scene geometry to analyze object interactions in the 3D world and then reasoning about scene geometry, occlusions and common sense domain knowledge using a set of meta-rules. The effectiveness of this approach is demonstrated on a set of videos of public spaces.

Original languageEnglish
Title of host publicationComputer Vision, ECCV 2010 - 11th European Conference on Computer Vision, Proceedings
PublisherSpringer Verlag
Pages693-706
Number of pages14
EditionPART 2
ISBN (Print)3642155510, 9783642155512
DOIs
StatePublished - 2010
Event11th European Conference on Computer Vision, ECCV 2010 - Heraklion, Crete, Greece
Duration: 10 Sep 201011 Sep 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume6312 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th European Conference on Computer Vision, ECCV 2010
Country/TerritoryGreece
CityHeraklion, Crete
Period10/09/1011/09/10

Keywords

  • Markov Logic Networks
  • Scene Understanding

Fingerprint

Dive into the research topics of 'Why did the person cross the road (there)? Scene understanding using probabilistic logic models and common sense reasoning'. Together they form a unique fingerprint.

Cite this