Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Nonnegative tensor-based linear dynamical systems for recognizing human action from 3D skeletons - MaRDI portal

Nonnegative tensor-based linear dynamical systems for recognizing human action from 3D skeletons (Q2298940)

From MaRDI portal





scientific article
Language Label Description Also known as
English
Nonnegative tensor-based linear dynamical systems for recognizing human action from 3D skeletons
scientific article

    Statements

    Nonnegative tensor-based linear dynamical systems for recognizing human action from 3D skeletons (English)
    0 references
    0 references
    20 February 2020
    0 references
    Summary: Recently, skeleton-based action recognition has become a very important topic in the field of computer vision. It is a challenging task to accurately build a human action model and precisely distinguish similar human actions. In this paper, an action (skeleton sequence) is represented as a third-order nonnegative tensor time series to capture the original spatiotemporal information of the action. As a linear dynamical system (LDS) is an efficient tool for encoding the spatiotemporal data in various disciplines, this paper proposes a nonnegative tensor-based LDS (nLDS) to model the third-order nonnegative tensor time series. Nonnegative Tucker decomposition (NTD) is utilized to estimate the parameters of the nLDS model. These parameters are used to build extended observability sequence \(\mathbf{O}_{\infty}^T\) for the action, which implies that \(\mathbf{O}_{\infty}^T\) can be considered as the feature descriptor of the action. To avoid the limitations introduced by approximating \(\mathbf{O}_{\infty}^T\) with a finite-order matrix, we represent an action as a point on infinite Grassmann manifold comprising the orthonormalized extended observability sequences. The classification task can be performed by dictionary learning and sparse coding on the infinite Grassmann manifold. The experimental results on the MSR-Action3D, UTKinect-Action, and G3D-Gaming datasets demonstrate that the proposed approach achieves a better performance in comparison with the state-of-the-art methods.
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references