Translational Equivariance in Kernelizable Attention

Horn, Max; Shridhar, Kumar; Groenewald, Elrich; Baumann, Philipp F. M.

Computer Science > Machine Learning

arXiv:2102.07680 (cs)

[Submitted on 15 Feb 2021]

Title:Translational Equivariance in Kernelizable Attention

Authors:Max Horn, Kumar Shridhar, Elrich Groenewald, Philipp F. M. Baumann

View PDF

Abstract:While Transformer architectures have show remarkable success, they are bound to the computation of all pairwise interactions of input element and thus suffer from limited scalability. Recent work has been successful by avoiding the computation of the complete attention matrix, yet leads to problems down the line. The absence of an explicit attention matrix makes the inclusion of inductive biases relying on relative interactions between elements more challenging. An extremely powerful inductive bias is translational equivariance, which has been conjectured to be responsible for much of the success of Convolutional Neural Networks on image recognition tasks. In this work we show how translational equivariance can be implemented in efficient Transformers based on kernelizable attention - Performers. Our experiments highlight that the devised approach significantly improves robustness of Performers to shifts of input images compared to their naive application. This represents an important step on the path of replacing Convolutional Neural Networks with more expressive Transformer architectures and will help to improve sample efficiency and robustness in this realm.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2102.07680 [cs.LG]
	(or arXiv:2102.07680v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2102.07680

Submission history

From: Max Horn [view email]
[v1] Mon, 15 Feb 2021 17:14:15 UTC (49 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-02

Change to browse by:

cs
cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Max Horn
Kumar Shridhar

export BibTeX citation

Computer Science > Machine Learning

Title:Translational Equivariance in Kernelizable Attention

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Translational Equivariance in Kernelizable Attention

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators