Action Recognition: First-and Second-Order 3D Feature in Bi-Directional Attention Network

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 187
  • Download : 0
This paper considers a 3D convolutional neural network (CNN) that learns spatial and temporal regions of higher importance through a bi-direction long short-term memory (bi-LSTM) attention for action recognition. First- and second-order differences of spatially most relevant C3D features (sp-m-C3D) are obtained, and the concatenation of the two differences with the sp-m-C3D is used to generate a temporal attention on the sp-m-C3D using a bi-LSTM. Temporally most relevant sp-m-C3D features are fed into another bi-LSTM for action recognition. Essentially, the network learns spatial and temporal regions of high importance for action recognition. We evaluate the network on two public action recognition datasets: UCF-101 (YouTube Action) and HMDB51. The proposed network performs better compared to other state-of-the-art networks.
Publisher
IEEE
Issue Date
2018-10-10
Language
English
Citation

2018 25th IEEE International Conference on Image Processing (ICIP), pp.3453 - 3457

DOI
10.1109/icip.2018.8451493
URI
http://hdl.handle.net/10203/247329
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0