Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SecGenEval

Benchmarks

Task NameDataset NameSOTA ResultTrend
Issue-level LocalizationSecGenEval-PS CodeAnalysis
Success Rate @ Issue59.1
11
Binary AccuracySecGenEval-PS CodeAnalysis
Accuracy100
11
Showing 2 of 2 rows