Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conversational Behavior Reasoning on Synthetic
Loading...
0.48
BLEU-1
GoT
0.456
0.468
0.48
0.492
Dec 25, 2025
BLEU-1
ROUGE-1
ROUGE-L
Similarity Score
Updated 4d ago
Evaluation Results
Method
Method
Links
BLEU-1
ROUGE-1
ROUGE-L
Similarity Score
GoT
2025.12
0.48
0.47
0.42
0.52
Feedback
Search any
task
Search any
task