Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Binary Inconsistency Detection on LLM
Loading...
70.27
Accuracy
ChatGPT-SpanMoE
38.6228
46.8389
55.055
63.2711
Jun 5, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
ChatGPT-SpanMoE
Setting=Few-Shot
2024.06
70.27
ChatGPT-SpanMoE
Setting=Zero-Shot
2024.06
67.96
ChatGPT-Span
Setting=Few-Shot
2024.06
64.84
ChatGPT-Span
Setting=Zero-Shot
2024.06
63.89
ChatGPT-DA
Setting=Few-Shot
2024.06
61.61
ChatGPT-DA
Setting=Zero-Shot
2024.06
60.34
SummaC
Setting=Zero-Shot
2024.06
49.7
QuestEval
2024.06
49.47
SummaC-Conv
2024.06
46.92
QAFactEval
2024.06
39.84
Feedback
Search any
task
Search any
task