Share your thoughts, 1 month free Claude Pro on usSee more

Uncertainty Estimation on AmbigQA

78.5AUROC

Kernel Lang. Ent.

Updated 3mo ago

Evaluation Results

Method	Links
Kernel Lang. Ent. 2026.04		78.5
Total 2026.04		76.8
SelfCheckGPT 2026.04		73.3
Aleatoric 2026.04		71.6
Closeness Centrality 2026.04		68.3
SemanticEntropy 2026.04		67.8
SC + VC 2026.04		67.1
Perplexity 2026.04		66.1
SC Based VC 2026.04		65.8
Max Token Prob. 2026.04		65.8
Mean Token Entropy 2026.04		65.4
Token Entropy 2026.04		65.4
Max Sequence Prob. 2026.04		65.1
SC Score 2026.04		64.9
PTrue 2026.04		60.4
Self Certainty 2026.04		56.9