Facial Action Unit Detection via Adaptive Attention and Relation

Shao, Zhiwen; Zhou, Yong; Cai, Jianfei; Zhu, Hancheng; Yao, Rui

doi:10.1109/TIP.2023.3277794

Computer Science > Computer Vision and Pattern Recognition

arXiv:2001.01168 (cs)

[Submitted on 5 Jan 2020 (v1), last revised 17 May 2023 (this version, v2)]

Title:Facial Action Unit Detection via Adaptive Attention and Relation

Authors:Zhiwen Shao, Yong Zhou, Jianfei Cai, Hancheng Zhu, Rui Yao

View PDF

Abstract:Facial action unit (AU) detection is challenging due to the difficulty in capturing correlated information from subtle and dynamic AUs. Existing methods often resort to the localization of correlated regions of AUs, in which predefining local AU attentions by correlated facial landmarks often discards essential parts, or learning global attention maps often contains irrelevant areas. Furthermore, existing relational reasoning methods often employ common patterns for all AUs while ignoring the specific way of each AU. To tackle these limitations, we propose a novel adaptive attention and relation (AAR) framework for facial AU detection. Specifically, we propose an adaptive attention regression network to regress the global attention map of each AU under the constraint of attention predefinition and the guidance of AU detection, which is beneficial for capturing both specified dependencies by landmarks in strongly correlated regions and facial globally distributed dependencies in weakly correlated regions. Moreover, considering the diversity and dynamics of AUs, we propose an adaptive spatio-temporal graph convolutional network to simultaneously reason the independent pattern of each AU, the inter-dependencies among AUs, as well as the temporal dependencies. Extensive experiments show that our approach (i) achieves competitive performance on challenging benchmarks including BP4D, DISFA, and GFT in constrained scenarios and Aff-Wild2 in unconstrained scenarios, and (ii) can precisely learn the regional correlation distribution of each AU.

Comments:	This paper has been accepted by IEEE Transactions on Image Processing (TIP)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2001.01168 [cs.CV]
	(or arXiv:2001.01168v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2001.01168
Related DOI:	https://doi.org/10.1109/TIP.2023.3277794

Submission history

From: Zhiwen Shao [view email]
[v1] Sun, 5 Jan 2020 05:14:03 UTC (4,653 KB)
[v2] Wed, 17 May 2023 03:18:31 UTC (6,721 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Facial Action Unit Detection via Adaptive Attention and Relation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Facial Action Unit Detection via Adaptive Attention and Relation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators