Learned Spatio-Temporal Texture Descriptors for RGB-D Human Action Recognition

Authors

  • Zhengyuan Zhai Beijing University of Posts and Telecommunications, Beijing University of Posts and Telecommunications, 100 876 Beijing
  • Chunxiao Fan Beijing University of Posts and Telecommunications, Beijing University of Posts and Telecommunications, 100 876 Beijing
  • Yue Ming Beijing University of Posts and Telecommunications, Beijing University of Posts and Telecommunications, 100 876 Beijing

Keywords:

3D pixel differences vectors, compact binary face descriptor, feature fusion, human action recognition, RGB-depth videos

Abstract

Due to the recent arrival of Kinect, action recognition with depth images has attracted researchers' wide attentions and various descriptors have been proposed, where Local Binary Patterns (LBP) texture descriptors possess the properties of appearance invariance. However, the LBP and its variants are most artificially-designed, demanding engineers' strong prior knowledge and not discriminative enough for recognition tasks. To this end, this paper develops compact spatio-temporal texture descriptors, i.e. 3D-compact LBP (3D-CLBP) and local depth patterns (3D-CLDP), for color and depth videos in the light of compact binary face descriptor learning in face recognition. Extensive experiments performed on three standard datasets, 3D Online Action, MSR Action Pairs and MSR Daily Activity 3D, demonstrate that our method is superior to most comparative methods in respects of performance and can capture spatial-temporal texture cues in videos.

Downloads

Download data is not yet available.

Downloads

Published

2019-02-04

How to Cite

Zhai, Z., Fan, C., & Ming, Y. (2019). Learned Spatio-Temporal Texture Descriptors for RGB-D Human Action Recognition. COMPUTING AND INFORMATICS, 37(6), 1339–1362. Retrieved from https://www.cai.sk/ojs/index.php/cai/article/view/2018_6_1339