Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-turn embodied reasoning on BabyAI

73Success Rate

ReMem

21.218434.661748.10561.5483Nov 25, 2025Dec 25, 2025Jan 24, 2026Feb 23, 2026Mar 25, 2026Apr 24, 2026May 24, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2025.11
7383-
2025.11
6373-
2025.11
6272-
2025.11
6171-
2025.11
5772-
2025.11
5665-
2026.05
55.35-70.07
2025.11
5466-
2025.11
5364-
2025.11
5364-
2025.11
5366-
2025.11
5364-
2025.11
5361-
2025.11
5368-
2025.11
5264-
2025.11
5265-
2025.11
5264-
2025.11
5268-
2025.11
5166-
2025.11
5067-
2026.05
50-63.12
2026.05
48.21-60.25
2026.05
48.2-64.1
2025.11
4863-
2025.11
4866-
2025.11
4866-
2026.05
47.3-61.4
2025.11
4664-
2025.11
4664-
2026.05
41.96-58.15
2026.05
40.17-50.62
2026.05
37.5-50.4
2026.05
37.5-50.16
2026.05
35.71-46.41
2026.05
26.78-42.44
2026.05
25-35.3
2026.05
23.21-35.55