Share your thoughts, 1 month free Claude Pro on usSee more

Selective Generation on TruthfulQA (PRR AlignScore)

35.3PRR (AlignScore)

SATRMD+MSP

Updated 2mo ago

Evaluation Results

Method	Links
SATRMD+MSP 2025.02		35.3
HUQ-SATRMD 2025.02		30.8
Maximum Sequence Probability 2025.02		27.7
SentenceSAR 2025.02		18.5
Perplexity 2025.02		17.8
Semantic Entropy 2025.02		17.1
DegMat NLI Score Entail. 2025.02		15.6
EigValLaplacian NLI Score Entail. 2025.02		15.2
Eccentricity NLI Score Entail. 2025.02		12.2
SAPLMA 2025.02		11.2
SAR 2025.02		10.5
EigenScore 2025.02		2.3
Factoscope 2025.02		1.7
Lexical Similarity ROUGE-L 2025.02		0.8