Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Action Sequence Generation on ALFRED (val unseen)
Loading...
91
Exact Match
Alfred
15.08
34.79
54.5
74.21
Dec 25, 2025
Exact Match
LCSS Similarity
LCSA Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Exact Match
LCSS Similarity
LCSA Score
Alfred
Model=Vicuna, Size=7B,...
2025.12
91
96
94
Alfred
Model=Vicuna, Size=7B,...
2025.12
64
89
83
InnerMonologue
Model=Vicuna, Size=7B,...
2025.12
21
65
49
ProgPrompt
Model=Vicuna, Size=7B,...
2025.12
18
61
49
Feedback
Search any
task
Search any
task