Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Web BrowsingBC-plus
EM29.6
30
Causal InferenceBC out-of-sample
sqrt(PEHE)0.69
18
Causal InferenceBC within-sample
sqrt(PEHE)0.73
18
Classificationbc
Accuracy99
14
RF compressionbc
Performance Score98.7
14
Model Compressionbc
Accuracy / R20.99
13
Minority class representationBC
Minority Class Percentage30.5
13
Speech Quality AssessmentBC 19
LCC0.87
12
DCR baseline protection analysisBC
DCR Baseline Protection51.1
12
Membership Inference AttackBC
Success Rate53.6
12
Synthetic Data Evaluation (Column Pair Trends)BC
Column Pair Trends Score97.6
12
Overfitting Protection EvaluationBC
DCR Overfitting Protection0.965
12
Tabular Synthetic Data GenerationBC
Column Shape Score0.987
12
Utility evaluationBC
Balanced Acc72.1
11
Question AnsweringBC
Performance Score6.2
8
Deep ResearchBC (test)
Mean Correct Answer Rate620
8
ClusteringBC
Avg Silhouette Score0.557
7
ClusteringBC
ARI77.9
7
Point-level consensus correctness predictionBC
AUPRC97.1
4
Named Entity RecognitionBC (test)
Average F181.54
4
Vector Optimization (Obtuse Cone)BC
Epsilon-F199
3
Vector Optimization (Right Cone)BC
epsilon-F196
3
Vector Optimization (Acute Cone)BC
Epsilon-F195
3
ClassificationBC (test)
Parameters15
3
Showing 24 of 24 rows