Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM-simulated scientists

Benchmarks

Task NameDataset NameSOTA ResultTrend
Automated Research20 LLM-simulated scientists
Alignment Score8.963
7
Automated Research Efficiency Analysis20 LLM-simulated scientists
Average API Calls15.8
5
Showing 2 of 2 rows