Human Judgments

Benchmarks

Task Name	Dataset Name	SOTA Result
Attribution Coverage	Human Judgments	Pearson Correlation (r)0.97	8
MURGAT-SCORE	Human Judgments	Pearson Correlation (r)0.86	4
Attribution Precision	Human Judgments	Pearson Correlation (r)0.65	4

Showing 3 of 3 rows