Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Social Dialogue on SOTOPIA Interaction with GPT-4o-mini

7.53GOAL Score

GPT-4-turbo

6.63566.86787.17.3322Jan 3, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
7.532.54
2025.01
7.532.71
2025.01
7.472.4
2025.01
7.412.54
2025.01
7.382.56
2025.01
7.322.7
2025.01
7.192.13
2025.01
7.182.59
2025.01
7.182.52
6.982.11
6.671.84