Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Audit

Benchmarks

Task NameDataset NameSOTA ResultTrend
Counterfactual ExplanationsAudit
Coverage1
6
Log-Investigation Query GenerationAudit Complex (test)
F1 Score91
2
Log-Investigation Query GenerationAudit Simple (test)
F1 Score98.2
2
Showing 3 of 3 rows