Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ALFRED

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingALFRED
Accuracy19.84
36
Embodied AI Task PlanningEB-ALFRED
Average Score70.8
28
Continual Instruction FollowingALFRED
Success Rate (SR)69.9
28
Instruction followingALFRED (test-unseen)
GC61.6
23
Embodied Instruction FollowingALFRED seen 1.0 (test)
GC54.81
20
Mobile ManipulationALFRED (test unseen)
Success Rate (SR)60.79
18
Mobile ManipulationALFRED seen (test)
Success Rate (SR)65.09
18
Task PlanningEB-ALFRED (Long)
Success Rate (SR)70
17
Task Progress EstimationAlfred
pmae2.19
15
Embodied Task CompletionALFRED unseen (test)
Success Rate4,572
14
Embodied Task CompletionALFRED seen (test)
Success Rate (SR)53.23
14
Embodied PlanningALFRED
Success Rate (SR)45.81
11
Embodied AI Task ExecutionEB-ALFRED online unsupervised setting
Success Rate (Avg)61
10
Embodied Task CompletionALFRED EB
Avg Score32.2
8
3D Instruction FollowingALFRED
Accuracy62
8
Interactive PlanningALFRED unseen (val)
Success Rate (SR)67.8
8
Interactive PlanningALFRED (val seen)
SR46.59
6
Delivery (Pick-Place)ALFRED
TSR71.2
4
InspectionALFRED
TSR (%)78.5
4
NavigationALFRED
Task Success Rate (TSR)84
4
Action Sequence GenerationALFRED (val unseen)
Exact Match91
4
High-level PlanningALFRED (val unseen)
EM64
4
Subtask CompletionALFRED
Avg Completion Rate0.53
4
Embodied Task CompletionALFRED Unseen (val)
Task Success Rate (TSR)20
3
Embodied Task CompletionALFRED seen (val)
Task Success Rate (SR)3.4
3
Showing 25 of 26 rows