Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Narrative UnderstandingSC
First-Token Accuracy98.7
24
Integer Linear ProgrammingSC
Feasibility Rate (FR)100
13
Set CoveringSC-4000
Objective170.3
10
Set CoveringSC-2000
Objective Value291.5
10
Scientific ClaimsSC
R Score87.3
10
Temporal scRNA-seq data modelingSC Hard Schiebinger
Performance at t=513.66
9
scRNA-seq ExtrapolationSC Schier (Medium (Extrapolation))
Performance at t=1615.24
9
scRNA-seq InterpolationSC Easy Interpolation
Error at t=510.19
9
Multi-objective shortest pathSC 100
Hypervolume (HV)0.7
9
Sentence ClassificationSC
MNLI Accuracy90.53
9
Resource Constrained Shortest PathSC 100
Objective Value7.77
8
Set CoveringSC Large Scale
Gap Improvement400
8
Set CoveringSC Medium Scale
Gap Improvement200
8
Set CoverSC 2000 elements (Out-of-Distribution)
Objective Value9.3
7
Set CoverSC In-Distribution 2000 elements
Objective Value291.5
7
RegressionSC
Average Training Time (s)2.92
6
Speech ClassificationSC10 Raw (test)
Accuracy98.32
6
Unconditional generation of event sequencesSC synthetic (test)
MMD5.8
5
Temporal Point Process GenerationSC (test)
MMD5.8
5
Adversarial NSFW Image GenerationSC (test)
ASR-2570
5
Density EstimationSC (test)
MMD0.08
5
Density estimationSC Synthetic (test)
Wasserstein Distance0
5
Speech ClassificationSC10 Raw 0.5x (test)
Accuracy96.3
5
Sarcasm DetectionSC GEN v2
F1 Score75
5
Sarcasm DetectionSC v1
F1 Score69
5
Showing 25 of 29 rows