Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Uncertainty Estimation on CSQA
Loading...
0.769
AUROC
BSDETECTOR
0.47884
0.55417
0.6295
0.70483
Aug 30, 2023
AUROC
Updated 3d ago
Evaluation Results
Method
Method
Links
AUROC
BSDETECTOR
LLM=GPT-3.5 Turbo
2023.08
0.769
BSDETECTOR
LLM=Text-Davinci-003
2023.08
0.743
Temperature Sampling
LLM=GPT-3.5 Turbo
2023.08
0.583
Temperature Sampling
LLM=Text-Davinci-003
2023.08
0.54
Self-reflection Certainty
LLM=Text-Davinci-003
2023.08
0.539
Self-reflection Certainty
LLM=GPT-3.5 Turbo
2023.08
0.506
Likelihood Based Uncertainty
LLM=Text-Davinci-003
2023.08
0.49
Feedback
Search any
task
Search any
task