Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning, Knowledge, and Biomedicine

Benchmarks

Task NameDataset NameSOTA ResultTrend
General EvaluationReasoning, Knowledge, and Biomedicine combined datasets (test)
Average Score60.47
9
Showing 1 of 1 rows