Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Premise Generation on Rule Distillation Dataset LLM evaluation 1.0
Loading...
2.77
Accuracy
GPT-4
1.8756
2.1078
2.34
2.5722
Feb 18, 2024
Accuracy
Diversity
Complexity
Abstractness
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Diversity
Complexity
Abstractness
GPT-4
Prompting=Symbolic and...
2024.02
2.77
2.64
1.4
2.32
Engine
Backbone=Mistral-7b, F...
2024.02
2.34
1.89
1.62
2.43
GPT-3.5-Turbo
Prompting=Symbolic and...
2024.02
1.91
1.72
1.06
2.3
Feedback
Search any
task
Search any
task