Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DocPairBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Human Preference PredictionDocPairBench
Gov. Preference Score89.3
12
Showing 1 of 1 rows