Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GSA-R2R

Benchmarks

Task NameDataset NameSOTA ResultTrend
Vision-and-Language NavigationGSA-R2R N-Scene (test)
SR57
37
Vision-and-Language NavigationGSA-R2R N-Basic (test)
SR72
21
Vision-and-Language NavigationGSA-R2R R-Basic (test)
Success Rate79
21
Vision-Language NavigationGSA-R2R Basic Instructions Non-Residential v1 (test)
SR57.7
12
Vision-Language NavigationGSA-R2R Basic Instructions Residential v1 (test)
SR69.8
12
Vision-Language NavigationGSA-R2R User Instructions Residential v1 (test)
SR66.1
12
Vision-Language NavigationGSA-R2R Sheldon instructions
SR65.5
10
Vision-Language NavigationGSA-R2R Rachel instructions
SR68.6
10
Vision-Language NavigationGSA-R2R Moira instructions
SR62.3
10
Vision-Language NavigationGSA-R2R Keith instructions
SR68.3
10
Vision-Language NavigationGSA-R2R Child instructions
SR65.5
10
Vision-Language NavigationGSA-R2R (test-n-user)
SR58.65
4
Vision-Language NavigationGSA-R2R N-Basic (test)
SR75.36
4
Showing 13 of 13 rows