Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cybermetric

Benchmarks

Task NameDataset NameSOTA ResultTrend
Cybersecurity Knowledge EvaluationCybermetric 2000
Accuracy94.1
17
Domain-specific language task evaluationCyberMetric
Accuracy86.8
12
Showing 2 of 2 rows