Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reddit Summary

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reddit Summary AlignmentReddit Summary normalized rewards (test)
Faithfulness Reward0.55
60
Reddit SummaryReddit Summary (Faithful vs Preference 1) (test)
Hypervolume1.23
4
SummarizationReddit Summary
Hyper-volume17.556
2
Showing 3 of 3 rows