×
Oct 2, 2023 · In this article, we propose a Global and Local Spatio-Temporal Encoder (GLSTE) to model the Spatio-temporal correlation.
Feb 23, 2024 · In this article, we propose a Global and Local Spatio-. Temporal Encoder for 3D human pose estimation from monoc- ular videos with a parallel ...
The results of the method with 27 frames as input are better than the vast majority of recent SOTA methods with 81 and 243 frames as input, which indicates ...
The current state-of-the-art on HumanEva-I is GLA-GCN (T=27, GT). See a full comparison of 31 papers with code.
Mar 2, 2022 · We propose MixSTE (Mixed Spatio-Temporal Encoder), which has a temporal transformer block to separately model the temporal motion of each joint.
Aug 7, 2024 · In this paper, we propose PoseMamba, a novel purely SSM-based approach with linear complexity for 3D human pose estimation in monocular video.
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video. 2022. 24. SoloPose (H36M+HeatPose+H71M). 26.0, No, Monocular, 11.5.
May 23, 2024 · We propose a method termed GlPaLo (Global-Part-Local), which integrates global, part-level, and local information. It consists of two key modules: uMLPGraph ...
Next, our temporal transformer module analyzes global dependencies between each spatial feature represen- tation, and generates an accurate 3D pose estimation.
This paper presents some inspiring observations on the human body properties that hold heuristic patterns of human poses.
Missing: Spatio- | Show results with:Spatio-