TY - JOUR
T1 - Action Recognition Using 3D Histograms of Texture and A Multi-Class Boosting Classifier
AU - Zhang, Baochang
AU - Yang, Yun
AU - Chen, Chen
AU - Yang, Linlin
AU - Han, Jungong
AU - Shao, Ling
N1 - Publisher Copyright:
© 1992-2012 IEEE.
PY - 2017/10/1
Y1 - 2017/10/1
N2 - Human action recognition is an important yet challenging task. This paper presents a low-cost descriptor called 3D histograms of texture (3DHoTs) to extract discriminant features from a sequence of depth maps. 3DHoTs are derived from projecting depth frames onto three orthogonal Cartesian planes, i.e., the frontal, side, and top planes, and thus compactly characterize the salient information of a specific action, on which texture features are calculated to represent the action. Besides this fast feature descriptor, a new multi-class boosting classifier (MBC) is also proposed to efficiently exploit different kinds of features in a unified framework for action classification. Compared with the existing boosting frameworks, we add a new multi-class constraint into the objective function, which helps to maintain a better margin distribution by maximizing the mean of margin, whereas still minimizing the variance of margin. Experiments on the MSRAction3D, MSRGesture3D, MSRActivity3D, and UTD-MHAD data sets demonstrate that the proposed system combining 3DHoTs and MBC is superior to the state of the art.
AB - Human action recognition is an important yet challenging task. This paper presents a low-cost descriptor called 3D histograms of texture (3DHoTs) to extract discriminant features from a sequence of depth maps. 3DHoTs are derived from projecting depth frames onto three orthogonal Cartesian planes, i.e., the frontal, side, and top planes, and thus compactly characterize the salient information of a specific action, on which texture features are calculated to represent the action. Besides this fast feature descriptor, a new multi-class boosting classifier (MBC) is also proposed to efficiently exploit different kinds of features in a unified framework for action classification. Compared with the existing boosting frameworks, we add a new multi-class constraint into the objective function, which helps to maintain a better margin distribution by maximizing the mean of margin, whereas still minimizing the variance of margin. Experiments on the MSRAction3D, MSRGesture3D, MSRActivity3D, and UTD-MHAD data sets demonstrate that the proposed system combining 3DHoTs and MBC is superior to the state of the art.
KW - Action recognition
KW - boosting classifier
KW - depth image
KW - multi-class classification
KW - texture feature
UR - http://www.scopus.com/inward/record.url?scp=85021832532&partnerID=8YFLogxK
U2 - 10.1109/TIP.2017.2718189
DO - 10.1109/TIP.2017.2718189
M3 - Article
SN - 1057-7149
VL - 26
SP - 4648
EP - 4660
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 10
M1 - 7954740
ER -