Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Ideation

Benchmarks

Task NameDataset NameSOTA ResultTrend
Scientific IdeationScientific Ideation 60 samples human-validated (test)
Novelty3.63
9
Scientific IdeationScientific Ideation Out-of-Domain
GPT-5.2 Score59
2
Showing 2 of 2 rows