Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conclusion Generation on Rule Distillation Dataset LLM evaluation 1.0
Loading...
2.53
Accuracy
GPT-4
2.374
2.4145
2.455
2.4955
Feb 18, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4
Prompting=Symbolic and...
2024.02
2.53
Engine
Backbone=Mistral-7b, F...
2024.02
2.44
GPT-3.5-Turbo
Prompting=Symbolic and...
2024.02
2.38
Feedback
Search any
task
Search any
task