Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LongForm

Benchmarks

Task NameDataset NameSOTA ResultTrend
Watermark Detectionlongform_qa
Accuracy100
48
Detection AccuracyLongForm QA
Accuracy99.88
24
DetectionLongForm
Score (gpt-5.1)100
5
PreventionLongForm
Score (gpt-5.1)100
5
Showing 4 of 4 rows