AbHE: All Attention-based Homography Estimation

Huo, Mingxiao; Zhang, Zhihao; Ren, Xinyang; Yang, Xianqiang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.03029 (cs)

[Submitted on 6 Dec 2022 (v1), last revised 5 Feb 2023 (this version, v3)]

Title:AbHE: All Attention-based Homography Estimation

Authors:Mingxiao Huo, Zhihao Zhang, Xinyang Ren, Xianqiang Yang

View PDF

Abstract:Homography estimation is a basic computer vision task, which aims to obtain the transformation from multi-view images for image alignment. Unsupervised learning homography estimation trains a convolution neural network for feature extraction and transformation matrix regression. While the state-of-theart homography method is based on convolution neural networks, few work focuses on transformer which shows superiority in highlevel vision tasks. In this paper, we propose a strong-baseline model based on the Swin Transformer, which combines convolution neural network for local features and transformer module for global features. Moreover, a cross non-local layer is introduced to search the matched features within the feature maps coarsely. In the homography regression stage, we adopt an attention layer for the channels of correlation volume, which can drop out some weak correlation feature points. The experiment shows that in 8 Degree-of-Freedoms(DOFs) homography estimation our method overperforms the state-of-the-art method.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2212.03029 [cs.CV]
	(or arXiv:2212.03029v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.03029

Submission history

From: Mingxiao Huo [view email]
[v1] Tue, 6 Dec 2022 15:00:00 UTC (187 KB)
[v2] Wed, 7 Dec 2022 02:04:41 UTC (187 KB)
[v3] Sun, 5 Feb 2023 18:41:36 UTC (4,388 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:AbHE: All Attention-based Homography Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:AbHE: All Attention-based Homography Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators