Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SGI-bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Idea Generation EvaluationSGI-bench
Novelty80.53
7
Scientific General IntelligenceSGI-bench
Deep Research37.74
6
Showing 2 of 2 rows