Guided Attention for Next Active Object @ EGO4D STA Challenge
About
In this technical report, we describe the Guided-Attention mechanism based solution for the short-term anticipation (STA) challenge for the EGO4D challenge. It combines the object detections, and the spatiotemporal features extracted from video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of STA in egocentric videos. For the challenge, we build our model on top of StillFast with Guided Attention applied on fast network. Our model obtains better performance on the validation set and also achieves state-of-the-art (SOTA) results on the challenge test set for EGO4D Short-Term Object Interaction Anticipation Challenge.
Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Short-Term Anticipation | Ego4D STA v2 (val) | N mAP20.52 | 16 | |
| Spatial-Temporal Anticipation | Ego4D STA v1, v2 (val) | Base Performance (B)45.3 | 14 | |
| Short-term object interaction anticipation | Ego4D | -- | 9 | |
| Short-term object interaction anticipation | EGO4D v2 (test) | Noun Top-5 mAP25.67 | 8 | |
| Spatio-Temporal Anticipation | Ego4D-STA v2 (test) | mAP (Noun)25.67 | 8 | |
| Short-term object interaction anticipation | EGO4D v2 (val) | Top-5 Noun mAP20.52 | 3 |
Showing 6 of 6 rows