Squeeze-and-Excitation on Spatial and Temporal Deep Feature Space for Action Recognition

An, Gaoyun; Zhou, Wen; Wu, Yuxuan; Zheng, Zhenxing; Liu, Yongwen

Computer Science > Computer Vision and Pattern Recognition

arXiv:1806.00631 (cs)

[Submitted on 2 Jun 2018 (v1), last revised 20 Jul 2018 (this version, v2)]

Title:Squeeze-and-Excitation on Spatial and Temporal Deep Feature Space for Action Recognition

Authors:Gaoyun An, Wen Zhou, Yuxuan Wu, Zhenxing Zheng, Yongwen Liu

View PDF

Abstract:Spatial and temporal features are two key and complementary information for human action recognition. In order to make full use of the intra-frame spatial characteristics and inter-frame temporal relationships, we propose the Squeeze-and-Excitation Long-term Recurrent Convolutional Networks (SE-LRCN) for human action recognition. The Squeeze and Excitation operations are used to implement the feature recalibration. In SE-LRCN, Squeeze-and-Excitation ResNet-34 (SE-ResNet-34) network is adopted to extract spatial features to enhance the dependencies and importance of feature channels of pixel granularity. We also propose the Squeeze-and-Excitation Long Short-Term Memory (SE-LSTM) network to model the temporal relationship, and to enhance the dependencies and importance of feature channels of frame granularity. We evaluate the proposed model on two challenging benchmarks, HMDB51 and UCF101, and the proposed SE-LRCN achieves the competitive results with the state-of-the-art.

Comments:	Need to be Revised
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1806.00631 [cs.CV]
	(or arXiv:1806.00631v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1806.00631

Submission history

From: Zhenxing Zheng [view email]
[v1] Sat, 2 Jun 2018 13:09:50 UTC (1,276 KB)
[v2] Fri, 20 Jul 2018 02:14:33 UTC (561 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Gaoyun An
Wen Zhou
Yuxuan Wu
ZhenXing Zheng
Yongwen Liu

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Squeeze-and-Excitation on Spatial and Temporal Deep Feature Space for Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Squeeze-and-Excitation on Spatial and Temporal Deep Feature Space for Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators