Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

VLM-Social-Nav: Socially Aware Robot Navigation through Scoring using Vision-Language Models

About

We propose VLM-Social-Nav, a novel Vision-Language Model (VLM) based navigation approach to compute a robot's motion in human-centered environments. Our goal is to make real-time decisions on robot actions that are socially compliant with human expectations. We utilize a perception model to detect important social entities and prompt a VLM to generate guidance for socially compliant robot behavior. VLM-Social-Nav uses a VLM-based scoring module that computes a cost term that ensures socially appropriate and effective robot actions generated by the underlying planner. Our overall approach reduces reliance on large training datasets and enhances adaptability in decision-making. In practice, it results in improved socially compliant navigation in human-shared environments. We demonstrate and evaluate our system in four different real-world social navigation scenarios with a Turtlebot robot. We observe at least 27.38% improvement in the average success rate and 19.05% improvement in the average collision rate in the four social navigation scenarios. Our user study score shows that VLM-Social-Nav generates the most socially compliant navigation behavior.

Daeun Song, Jing Liang, Amirreza Payandeh, Amir Hossain Raj, Xuesu Xiao, Dinesh Manocha• 2024

Related benchmarks

TaskDatasetResultRank
Social Robot NavigationWalking-talk experimental scenario 1.0
NT90
12
Social Robot NavigationQueuing experimental scenario 1.0
NT90
6
Socially-compliant Robot NavigationQueuing scenario 1.0 (test)
NT90
6
Social Robot NavigationWiping glass-wall experimental scenario 1.0
NT32.21
6
Robot navigationPhotography experimental scenario 1.0 (test)
NT27.01
6
Social Robot NavigationPhotography experimental scenario 1.0
NT27.01
6
Social NavigationLISN Reception Desk Arena 3.0 (Scenario b)
Success Rate50
3
Social NavigationLISN Go Forklift in Hurry Arena 3.0 (Scenario d)
Success Rate60
3
Social NavigationLISN Follow Doctor Arena 3.0 (Scenario a)
Success Rate0.00e+0
3
Social NavigationLISN Public Area Arena 3.0 (Scenario c)
Success Rate0.3
3
Showing 10 of 11 rows

Other info

Follow for update