STAR: Sparse Transformer-based Action Recognition

Shi, Feng; Lee, Chonghan; Qiu, Liang; Zhao, Yizhou; Shen, Tianyi; Muralidhar, Shivran; Han, Tian; Zhu, Song-Chun; Narayanan, Vijaykrishnan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2107.07089 (cs)

[Submitted on 15 Jul 2021]

Title:STAR: Sparse Transformer-based Action Recognition

Authors:Feng Shi, Chonghan Lee, Liang Qiu, Yizhou Zhao, Tianyi Shen, Shivran Muralidhar, Tian Han, Song-Chun Zhu, Vijaykrishnan Narayanan

View PDF

Abstract:The cognitive system for human action and behavior has evolved into a deep learning regime, and especially the advent of Graph Convolution Networks has transformed the field in recent years. However, previous works have mainly focused on over-parameterized and complex models based on dense graph convolution networks, resulting in low efficiency in training and inference. Meanwhile, the Transformer architecture-based model has not yet been well explored for cognitive application in human action and behavior estimation. This work proposes a novel skeleton-based human action recognition model with sparse attention on the spatial dimension and segmented linear attention on the temporal dimension of data. Our model can also process the variable length of video clips grouped as a single batch. Experiments show that our model can achieve comparable performance while utilizing much less trainable parameters and achieve high speed in training and inference. Experiments show that our model achieves 4~18x speedup and 1/7~1/15 model size compared with the baseline models at competitive accuracy.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2107.07089 [cs.CV]
	(or arXiv:2107.07089v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2107.07089

Submission history

From: Feng Shi [view email]
[v1] Thu, 15 Jul 2021 02:53:11 UTC (547 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-07

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Feng Shi
Liang Qiu
Tian Han
Song-Chun Zhu
Vijaykrishnan Narayanan

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:STAR: Sparse Transformer-based Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:STAR: Sparse Transformer-based Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators