Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Ego4D

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hand Pose EstimationEgo4D HInt v1 (test)
PCK @ 0.0559.3
32
Long-term action anticipationEgo4D v1 (test)
ED@Z=20 Verb0.679
31
State change classificationEgo4D v1 (test)
Accuracy75
29
Video GroundingEgo4D-NLQ v1 (test)
Recall@1 (Avg)24.96
27
Temporal GroundingEgo4D-NLQ
R@1 (IoU=0.3)29.06
25
Action RecognitionEgo4D v1 (test)
Top-1 Accuracy (Verb)25.1
23
Natural Language QueriesEgo4D NLQ (val)
Recall@1 (IoU=0.3)21.97
23
Point-of-no-return temporal localizationEgo4D v1 (test)
Error0.61
21
Natural Language QueriesEgo4D NLQ (test)
R@1 (IoU=0.3)26.67
21
Online Action DetectionEgo4D GoalStep
Segment F111
20
Temporal GroundingEgo4D-NLQ (test)
R@1 (IoU=0.3)22.21
20
Moment QueryEgo4D Moment Query (val)
R@1 (IoU=0.5)51.04
19
Common and General Video CommentaryEgo4d
F145.82
18
Long-Term AnticipationEgo4D LTA v1 (test)
ED@Z=20 Verb0.65
18
Verb RecognitionEgo4D
Top-1 Acc28.93
17
Noun RecognitionEgo4D
Top-1 Acc35.85
17
Short-Term AnticipationEgo4D STA v2 (val)
N mAP37.41
16
Task Progress EstimationEgo4D
PMAE19.25
15
Spatial-Temporal AnticipationEgo4D STA v1, v2 (val)
Base Performance (B)55.98
14
Narrative ReasoningEgo4D (test)
BLEURT0.48
14
Action RecognitionMMG-Ego4D 1.0 (test)
Accuracy (5-way 5-shot)63
13
Object State Change Classification (OSCC)Ego4D (test)
Accuracy75
13
Video Temporal GroundingEgo4D NLQ v1 (val)
R@1 (IoU=0.3)18.81
12
Pronoun Coreference ResolutionEgo4D (test)
Accuracy52.7
12
Speaking Target IdentificationEgo4D v1.0 (test)
Accuracy66.5
12
Showing 25 of 109 rows