Share your thoughts, 1 month free Claude Pro on usSee more

Selective Generation on MedQUAD (PRR Metrics)

46.6PRR (ROUGE-L)

SATRMD+MSP

Updated 2mo ago

Evaluation Results

Method	Links
SATRMD+MSP 2025.02		46.6	57.5
Perplexity 2025.02		42.5	43.8
SAPLMA 2025.02		40.7	49
HUQ-SATRMD 2025.02		38.6	50.6
Factoscope 2025.02		35.8	42.8
Maximum Sequence Probability 2025.02		29.7	35.6
SAR 2025.02		28.6	19.2
Lexical Similarity ROUGE-L 2025.02		25.2	13.2
Semantic Entropy 2025.02		7.5	0.7
Eccentricity NLI Score Entail. 2025.02		7	6
DegMat NLI Score Entail. 2025.02		6.6	16.2
EigValLaplacian NLI Score Entail. 2025.02		5.6	16
EigenScore 2025.02		5	4.3
SentenceSAR 2025.02		1.5	3.3