Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Performance Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
General PerformancePerformance Bench Reasoning & Knowledge
Average Score78.37
9
General Performance EvaluationPerformance Bench Aggregate
Average Score82.49
9
Showing 2 of 2 rows