Uncertainty-Aware Anticipation of Activities
About
Anticipating future activities in video is a task with many practical applications. While earlier approaches are limited to just a few seconds in the future, the prediction time horizon has just recently been extended to several minutes in the future. However, as increasing the predicted time horizon, the future becomes more uncertain and models that generate a single prediction fail at capturing the different possible future activities. In this paper, we address the uncertainty modelling for predicting long-term future activities. Both an action model and a length model are trained to model the probability distribution of the future activities. At test time, we sample from the predicted distributions multiple samples that correspond to the different possible sequences of future activities. Our model is evaluated on two challenging datasets and shows a good performance in capturing the multi-modal future activities without compromising the accuracy when predicting a single sequence of future activities.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Action Anticipation | Breakfast | MoC Accuracy20.73 | 64 | |
| Long-term Action Anticipation | 50 Salads | MoC Accuracy29.1 | 56 | |
| Procedure Planning | CrossTask | Success Rate (SR)2.15 | 35 | |
| Procedure Planning | CrossTask short horizon T=3 | SR2.15 | 11 | |
| Procedure Planning | CrossTask short horizon T=4 | SR0.98 | 10 | |
| Long-term Action Anticipation | 50 Salads (test) | MoC (alpha=0.2, beta=0.1)24.86 | 10 | |
| Procedure Planning | CrossTask T=3 (test) | SR2.15 | 9 | |
| Long-term Action Anticipation | Breakfast (test) | MoC (alpha=0.2, beta=0.1)16.71 | 9 | |
| Dense Action Anticipation | 50 Salads (50S) | Top-1 Acc (tau_o=20%, tau_a=10%)28.9 | 8 | |
| Procedure Planning | CrossTask T=4 (test) | SR0.98 | 8 |