Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WaterBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
SummarizationWaterBench (test)
GM22.03
11
Reasoning & CodingWaterBench (test)
GM59.82
11
Long-form QAWaterBench (test)
GM Score24.06
11
Diffusion Language Model WatermarkingWaterBench 600 prompts 2024
PPL2.8
9
Text Generation Quality EvaluationWaterBench 1000 prompts
PPL9.878
6
Watermarking DetectionWaterBench 1000 prompts
Completeness98.3
5
Showing 6 of 6 rows