Subspace Kernel Learning on Tensor Sequences
About
Learning from structured multi-way data, represented as higher-order tensors, requires capturing complex interactions across tensor modes while remaining computationally efficient. We introduce Uncertainty-driven Kernel Tensor Learning (UKTL), a novel kernel framework for $M$-mode tensors that compares mode-wise subspaces derived from tensor unfoldings, enabling expressive and robust similarity measure. To handle large-scale tensor data, we propose a scalable Nystr\"{o}m kernel linearization with dynamically learned pivot tensors obtained via soft $k$-means clustering. A key innovation of UKTL is its uncertainty-aware subspace weighting, which adaptively down-weights unreliable mode components based on estimated confidence, improving robustness and interpretability in comparisons between input and pivot tensors. Our framework is fully end-to-end trainable and naturally incorporates both multi-way and multi-mode interactions through structured kernel compositions. Extensive evaluations on action recognition benchmarks (NTU-60, NTU-120, Kinetics-Skeleton) show that UKTL achieves state-of-the-art performance, superior generalization, and meaningful mode-wise insights. This work establishes a principled, scalable, and interpretable kernel learning paradigm for structured multi-way and multi-modal tensor sequences.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Action Recognition | NTU-60 (xsub) | Accuracy93.1 | 223 | |
| Action Recognition | NTU-120 (cross-subject (xsub)) | Accuracy90 | 211 | |
| Action Recognition | NTU 120 (Cross-Setup) | Accuracy91.4 | 203 | |
| Action Recognition | NTU-60 (xview) | Accuracy97.3 | 117 | |
| Skeleton-based Action Recognition | Kinetics-Skeleton | Top-1 Acc39.2 | 102 | |
| Skeleton-based Action Recognition | NTU-60 | FPS450 | 3 |