Share your thoughts, 1 month free Claude Pro on usSee more

Selective Generation on SciQ (PRR (AlignScore))

65.3PRR (AlignScore)

HUQ-SATRMD

Updated 2mo ago

Evaluation Results

Method	Links
HUQ-SATRMD 2025.02		65.3
Maximum Sequence Probability 2025.02		58.2
SentenceSAR 2025.02		54.3
SATRMD+MSP 2025.02		54.2
Semantic Entropy 2025.02		46.6
DegMat NLI Score Entail. 2025.02		44.6
Eccentricity NLI Score Entail. 2025.02		44.4
SAR 2025.02		44
EigValLaplacian NLI Score Entail. 2025.02		39.8
SAPLMA 2025.02		38.8
EigenScore 2025.02		37.3
Lexical Similarity ROUGE-L 2025.02		36
Factoscope 2025.02		31.6
Perplexity 2025.02		19.7