Hierarchical dynamic parsing and encoding for action recognition

Bing Su*, Jiahuan Zhou, Xiaoqing Ding, Hao Wang, Ying Wu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Scopus citations

Abstract

A video action generally exhibits quite complex rhythms and non-stationary dynamics. To model such non-uniform dynamics, this paper describes a novel hierarchical dynamic encoding method to capture both the locally smooth dynamics and globally drastic dynamic changes. It provides a multi-layer joint representation for temporal modeling for action recognition. At the first layer, the action sequence is parsed in an unsupervised manner into several smooth-changing stages corresponding to different key poses or temporal structures. The dynamics within each stage are encoded by mean-pooling or learning to rank based encoding. At the second layer, the temporal information of the ordered dynamics extracted from the previous layer is encoded again to form the overall representation. Extensive experiments on a gesture action dataset (Chalearn) and several generic action datasets (Olympic Sports and Hol-lywood2) have demonstrated the effectiveness of the proposed method.

Original languageEnglish (US)
Title of host publicationComputer Vision - 14th European Conference, ECCV 2016, Proceedings
EditorsBastian Leibe, Jiri Matas, Nicu Sebe, Max Welling
PublisherSpringer Verlag
Pages202-217
Number of pages16
ISBN (Print)9783319464923
DOIs
StatePublished - 2016
Event14th European Conference on Computer Vision, ECCV 2016 - Amsterdam, Netherlands
Duration: Oct 8 2016Oct 16 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9908 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other14th European Conference on Computer Vision, ECCV 2016
Country/TerritoryNetherlands
CityAmsterdam
Period10/8/1610/16/16

Funding

This work was supported by National Basic Research Program of China (2013CB329305, 2013CB329403), Natural Science Foundation of China (61303164, 61402447, 61471214, 61502466) and Development Plan of Outstanding Young Talent from Institute of Software, Chinese Academy of Sciences (ISCAS2014-JQ02). This work was also supported by National Science Foundation grant IIS-0916607, IIS-1217302.

Keywords

  • Action recognition
  • Dynamic encoding
  • Hierarchical modeling

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Hierarchical dynamic parsing and encoding for action recognition'. Together they form a unique fingerprint.

Cite this