Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022

About

In this report, we present our approach and empirical results of applying masked autoencoders in two egocentric video understanding tasks, namely, Object State Change Classification and PNR Temporal Localization, of Ego4D Challenge 2022. As team TheSSVL, we ranked 2nd place in both tasks. Our code will be made available.

Jiachen Lei, Shuang Ma, Zhongjie Ba, Sai Vemprala, Ashish Kapoor, Kui Ren• 2022

Related benchmarks

TaskDatasetResultRank
Temporal AttributionEgo4D-M (test)
MAE (frames)48.96
3
Showing 1 of 1 rows

Other info

Follow for update