Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sentence-level Error Detection on HQ2A 1.0 (test)
Loading...
25.49
Exact Accuracy
GPT-3.5-Turbo
24.2155
24.85275
25.49
26.12725
Jul 16, 2024
Exact Accuracy
Adjacent Accuracy
Mismatch Rate
Weighted Accuracy
Consistency Score (SRC)
Updated 4d ago
Evaluation Results
Method
Method
Links
Exact Accuracy
Adjacent Accuracy
Mismatch Rate
Weighted Accuracy
Consistency Score (SRC)
GPT-3.5-Turbo
Approach=Zero-shot
2024.07
25.49
11.76
62.75
37.65
0.99
Feedback
Search any
task
Search any
task