Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

T/F

Benchmarks

Task NameDataset NameSOTA ResultTrend
DetectionT/F
GPT-5.1 Score (T/F)94.7
5
PreventionT/F
gpt-5.1 Score100
5
Showing 2 of 2 rows