Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Social Interaction on SOTOPIA all social scenarios
Loading...
7.62
Goal Score
Expert (GPT-4)
4.968
5.6565
6.345
7.0335
Mar 13, 2024
Goal Score
Overall Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Goal Score
Overall Score
Expert (GPT-4)
Backbone=GPT-4
2024.03
7.62
3.31
BC+SR
Backbone=Mistral-7B, M...
2024.03
7.62
3.44
Behavior Cloning (BC)
Backbone=Mistral-7B, M...
2024.03
7.27
3.41
Self-Reinforcement (SR)
Backbone=Mistral-7B, M...
2024.03
5.83
2.57
Base (Mistral-7B)
Backbone=Mistral-7B
2024.03
5.07
2.33
Feedback
Search any
task
Search any
task