Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning

Xia, Jiahao; qu, Weiwei; Huang, Wenjian; Zhang, Jianguo; Wang, Xi; Xu, Min

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.06541 (cs)

[Submitted on 13 Mar 2022 (v1), last revised 26 Mar 2022 (this version, v2)]

Title:Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning

Authors:Jiahao Xia, Weiwei qu, Wenjian Huang, Jianguo Zhang, Xi Wang, Min Xu

View PDF

Abstract:Heatmap regression methods have dominated face alignment area in recent years while they ignore the inherent relation between different landmarks. In this paper, we propose a Sparse Local Patch Transformer (SLPT) for learning the inherent relation. The SLPT generates the representation of each single landmark from a local patch and aggregates them by an adaptive inherent relation based on the attention mechanism. The subpixel coordinate of each landmark is predicted independently based on the aggregated feature. Moreover, a coarse-to-fine framework is further introduced to incorporate with the SLPT, which enables the initial landmarks to gradually converge to the target facial landmarks using fine-grained features from dynamically resized local patches. Extensive experiments carried out on three popular benchmarks, including WFLW, 300W and COFW, demonstrate that the proposed method works at the state-of-the-art level with much less computational complexity by learning the inherent relation between facial landmarks. The code is available at the project website.

Comments:	Accepted to CVPR2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.06541 [cs.CV]
	(or arXiv:2203.06541v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.06541

Submission history

From: Jiahao Xia [view email]
[v1] Sun, 13 Mar 2022 01:15:23 UTC (3,442 KB)
[v2] Sat, 26 Mar 2022 13:46:46 UTC (3,441 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators