Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ALFRED

Benchmarks

Task NameDataset NameSOTA ResultTrend
Embodied AI Task PlanningEB-ALFRED
Average Score82
72
Instruction FollowingALFRED
Accuracy19.84
57
Embodied Task CompletionALFRED EB
Avg Score92
36
Instruction followingALFRED (test-unseen)
GC94.5
31
Continual Instruction FollowingALFRED
Success Rate (SR)69.9
28
Embodied Task CompletionALFRED unseen (test)
Success Rate4,572
26
Embodied Task CompletionALFRED seen (test)
Success Rate (SR)53.23
26
Task PlanningEB-ALFRED (Long)
Success Rate (SR)74
23
Embodied Instruction FollowingALFRED seen 1.0 (test)
GC54.81
20
Mobile ManipulationALFRED (test unseen)
Success Rate (SR)60.79
18
Mobile ManipulationALFRED seen (test)
Success Rate (SR)65.09
18
Task Progress EstimationAlfred
pmae2.19
15
Skill EvaluationALFRED
Object Perception (Grounding) Accuracy78.01
12
Embodied PlanningALFRED
Success Rate (SR)45.81
11
Embodied AI Task ExecutionEB-ALFRED online unsupervised setting
Success Rate (Avg)61
10
3D Instruction FollowingALFRED
Accuracy62
8
Interactive PlanningALFRED unseen (val)
Success Rate (SR)67.8
8
Instruction FollowingALFRED seen (test)
Task Success Rate29.16
7
Language-driven scene representationALFRED Object Shift [OS]
F1 Score83.92
7
Language-driven scene representationALFRED Template Shift [TS]
F1 Score84.9
7
Language-driven scene representationALFRED In-Distribution [ID]
F1 Score84.28
7
Instruction FollowingALFRED unseen (val)
Task Success Rate9.7
6
Instruction FollowingALFRED seen (val)
Task Success Rate33.7
6
Interactive PlanningALFRED (val seen)
SR46.59
6
Action LearningALFRED (Q3)
Accuracy54.8
5
Showing 25 of 38 rows