Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Science

Benchmarks

Task NameDataset NameSOTA ResultTrend
Task RoutingScience
Cost ($)0.0276
15
Taxonomy ExpansionScience (SCI) SemEval-2016 Task 13
Chi-Squared13.2
10
Science ReasoningScience (out-of-distribution)
Accuracy65.12
6
Named Entity RecognitionScience
F1 Score56.3
5
Text-to-SQLScience Benchmark
Execution Accuracy51.8
4
Task-Efficient RoutingScience Curated Task Benchmark 1.0 (test)
Average Cost0.0054
3
Taxonomy ExpansionScience
Prec@144.7
3
Named Entity RecognitionScience English
F1 Score62.29
2
Showing 8 of 8 rows