Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PARAREL

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge ElicitationPARAREL (test)
Success Rate71.36
28
Hallucination predictionParaRel + domain
Accuracy69.24
10
Hallucination predictionParaRel original
Accuracy82.29
10
Showing 3 of 3 rows