Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conclusion Generation on Crafted Rules Dataset (test)
Loading...
0.739
BLEU
Engine
0.32196
0.43023
0.5385
0.64677
Feb 18, 2024
BLEU
Updated 4d ago
Evaluation Results
Method
Method
Links
BLEU
Engine
backbone=Mistral-7b, m...
2024.02
0.739
GPT-4
mode=prompting
2024.02
0.414
GPT-3.5
mode=prompting
2024.02
0.338
Feedback
Search any
task
Search any
task