Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Navigation on Full GOAT-Bench Synonyms (val)
Loading...
58.4
Success Rate (SR)
GOAT-GTSem
16.904
27.677
38.45
49.223
Oct 28, 2025
Success Rate (SR)
Success weighted by Path Length (SPL)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Success weighted by Path Length (SPL)
GOAT-GTSem
2025.10
58.4
43.5
LagMemo
2025.10
44.8
32.1
SenseAct-NN
variant=SC
2025.10
38.2
15.2
Modular GOAT
2025.10
33.8
24.4
Modular CoWs
2025.10
18.5
11.5
SenseAct-NN
variant=Mono
2025.10
18.5
10.1
Feedback
Search any
task
Search any
task