Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Combined Task Navigation on Real-world SCOUT 2.0 platform
Loading...
90
Success Rate (SR)
NORM-Nav
17.2
36.1
55
73.9
May 16, 2026
Success Rate (SR)
SPL
Failure Distance (FD)
Failure Behavior Analysis (BFA)
Updated 15d ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
SPL
Failure Distance (FD)
Failure Behavior Analysis (BFA)
NORM-Nav
2026.05
90
54.87
3.01
85
BehAV
2026.05
30
18.62
7.02
39
InstructNav
2026.05
20
12.21
6.45
36
Feedback
Search any
task
Search any
task