Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Omission Detection on Custom Dataset
Loading...
64.5
Accuracy
Features+ML
34.236
42.093
49.95
57.807
Mar 17, 2026
Accuracy
Precision
Recall
F1 Score
Updated 9d ago
Evaluation Results
Method
Method
Links
Accuracy
Precision
Recall
F1 Score
Features+ML
2026.03
64.5
66
62.9
63.7
GPT-4o-mini
2026.03
54.7
53.8
66.7
59.5
GPT-5
2026.03
53.3
24.1
71.8
36
Non-LLM Baseline
2026.03
53.2
53.1
53.3
53.2
SelfCheckGPT
2026.03
49.6
48.4
12.9
20.3
SAC3
Backbone=L3.3-70B
2026.03
45.1
38.4
16.1
22.7
O3
2026.03
35.4
20.2
85.6
32.8
Feedback
Search any
task
Search any
task