Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Temporal Question Answering on FreshQA
Loading...
0.657
AUROC
B1 entropy
0.38868
0.45834
0.528
0.59766
Mar 25, 2026
AUROC
AUROC 95% CI Lower Bound
Updated 23d ago
Evaluation Results
Method
Method
Links
AUROC
AUROC 95% CI Lower Bound
B1 entropy
Cost=Free
2026.03
0.657
0.61
P(True)
Cost=1 call
2026.03
0.399
0.366
Feedback
Search any
task
Search any
task