Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GSA-R2R

Benchmarks

Task NameDataset NameSOTA ResultTrend
Vision-and-Language NavigationGSA-R2R N-Scene (test)
SR56.67
26
Vision-Language NavigationGSA-R2R Basic Instructions Non-Residential v1 (test)
SR57.7
12
Vision-Language NavigationGSA-R2R Basic Instructions Residential v1 (test)
SR69.8
12
Vision-Language NavigationGSA-R2R User Instructions Residential v1 (test)
SR66.1
12
Vision-and-Language NavigationGSA-R2R N-Basic (test)
TL7.9
10
Vision-and-Language NavigationGSA-R2R R-Basic (test)
Trajectory Length8
10
Vision-Language NavigationGSA-R2R Sheldon instructions
SR65.5
10
Vision-Language NavigationGSA-R2R Rachel instructions
SR68.6
10
Vision-Language NavigationGSA-R2R Moira instructions
SR62.3
10
Vision-Language NavigationGSA-R2R Keith instructions
SR68.3
10
Vision-Language NavigationGSA-R2R Child instructions
SR65.5
10
Vision-Language NavigationGSA-R2R (test-n-user)
SR58.65
4
Vision-Language NavigationGSA-R2R N-Basic (test)
SR75.36
4
Showing 13 of 13 rows