Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PARAREL

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge ElicitationPARAREL (test)
Success Rate71.36
28
Hallucination predictionParaRel + domain
Accuracy69.24
10
Hallucination predictionParaRel original
Accuracy82.29
10
Fact AttributionPARAREL never seen before
Macro-F180.6
6
Showing 4 of 4 rows