Share your thoughts, 1 month free Claude Pro on usSee more

Conversation Evaluation on Robot Domain

88.57Human Score

GOOD

Updated 2mo ago

Evaluation Results

Method	Links
GOOD 2025.08		88.57	86.9
GOOD 2025.08		87.22	89.1
Full Context 2025.08		66.54	84.63