Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conversational Behavior Reasoning on CANDOR
Loading...
0.58
BLEU-1
GoT
0.551
0.5655
0.58
0.5945
Dec 25, 2025
BLEU-1
ROUGE-1
ROUGE-L
Similarity Score
Updated 4d ago
Evaluation Results
Method
Method
Links
BLEU-1
ROUGE-1
ROUGE-L
Similarity Score
GoT
2025.12
0.58
0.56
0.49
0.66
Feedback
Search any
task
Search any
task