Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Science

Benchmarks

Task NameDataset NameSOTA ResultTrend
Task RoutingScience
Cost ($)0.0276
15
Named Entity RecognitionScience
F1 Score61.84
12
Taxonomy ExpansionScience (SCI) SemEval-2016 Task 13
Chi-Squared13.2
10
Science ReasoningScience (out-of-distribution)
Accuracy65.12
6
Text-to-SQLScience Benchmark
Execution Accuracy51.8
4
Task-Efficient RoutingScience Curated Task Benchmark 1.0 (test)
Average Cost0.0054
3
Taxonomy ExpansionScience
Prec@144.7
3
Named Entity RecognitionScience English
F1 Score62.29
2
Showing 8 of 8 rows