Spatio-Temporal Analysis of Facial Actions using Lifecycle-Aware Capsule Networks

Churamani, Nikhil; Kalkan, Sinan; Gunes, Hatice

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.08819 (cs)

[Submitted on 17 Nov 2020 (v1), last revised 4 Mar 2021 (this version, v2)]

Title:Spatio-Temporal Analysis of Facial Actions using Lifecycle-Aware Capsule Networks

Authors:Nikhil Churamani, Sinan Kalkan, Hatice Gunes

View PDF

Abstract:Most state-of-the-art approaches for Facial Action Unit (AU) detection rely upon evaluating facial expressions from static frames, encoding a snapshot of heightened facial activity. In real-world interactions, however, facial expressions are usually more subtle and evolve in a temporal manner requiring AU detection models to learn spatial as well as temporal information. In this paper, we focus on both spatial and spatio-temporal features encoding the temporal evolution of facial AU activation. For this purpose, we propose the Action Unit Lifecycle-Aware Capsule Network (AULA-Caps) that performs AU detection using both frame and sequence-level features. While at the frame-level the capsule layers of AULA-Caps learn spatial feature primitives to determine AU activations, at the sequence-level, it learns temporal dependencies between contiguous frames by focusing on relevant spatio-temporal segments in the sequence. The learnt feature capsules are routed together such that the model learns to selectively focus more on spatial or spatio-temporal information depending upon the AU lifecycle. The proposed model is evaluated on the commonly used BP4D and GFT benchmark datasets obtaining state-of-the-art results on both the datasets.

Comments:	Updated Figure 6 and the Acknowledgements. Corrected typos. 11 pages, 6 figures, 3 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2011.08819 [cs.CV]
	(or arXiv:2011.08819v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.08819

Submission history

From: Nikhil Churamani [view email]
[v1] Tue, 17 Nov 2020 18:36:38 UTC (7,031 KB) (withdrawn)
[v2] Thu, 4 Mar 2021 02:41:43 UTC (7,698 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-Temporal Analysis of Facial Actions using Lifecycle-Aware Capsule Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-Temporal Analysis of Facial Actions using Lifecycle-Aware Capsule Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators