Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Confidence Estimation on Infeasible Benchmark
Loading...
0.961
Kaware
Verb
0.64276
0.72538
0.808
0.89062
Jan 14, 2026
Kaware
Uaware
Saware
Updated 25d ago
Evaluation Results
Method
Method
Links
Kaware
Uaware
Saware
Verb
2026.01
0.961
0.302
0.631
IC-IDK
2026.01
0.909
0.185
0.547
Prod-Prob
2026.01
0.864
0.187
0.525
Posterior
2026.01
0.852
0.174
0.513
Entropy
2026.01
0.801
0.603
0.702
Min-Prob
2026.01
0.763
0.256
0.51
Prior
2026.01
0.748
0.429
0.589
Fst-Prob
2026.01
0.655
0.271
0.413
Feedback
Search any
task
Search any
task