Liu, Tengyu

Learning Interaction Grammar for Action Recognition

2018

Liu, Tengyu
Advisor(s): Zhu, Song-Chun

Abstract

Video action recognition has been in the center of the stage since its introduction in 2004 [SLC04]. During the past 15 years, countless methods had been proposed to understand what human is doing in a video clip. While some works infer action label directly from pixel information, some other works propose to learn multi-level hierarchical structure that composes actions. In this work, we propose the learning of a two-layer grammar model for action recognition that is based on human object interaction. To evaluate the idea, we proposed an interaction grammar for action recognition on video. The model is evaluated by simulated annealing with MCMC and by deep learning.

Main Content

For improved accessibility of PDF content, download the file to your device.

UCLA

Learning Interaction Grammar for Action Recognition