Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal InferenceBC out-of-sample
sqrt(PEHE)0.69
18
Causal InferenceBC within-sample
sqrt(PEHE)0.73
18
Minority class representationBC
Minority Class Percentage30.5
13
Speech Quality AssessmentBC 19
LCC0.87
12
DCR baseline protection analysisBC
DCR Baseline Protection51.1
12
Membership Inference AttackBC
Success Rate53.6
12
Synthetic Data Evaluation (Column Pair Trends)BC
Column Pair Trends Score97.6
12
Overfitting Protection EvaluationBC
DCR Overfitting Protection0.965
12
Tabular Synthetic Data GenerationBC
Column Shape Score0.987
12
Utility evaluationBC
Balanced Acc72.1
11
Question AnsweringBC
Performance Score6.2
8
Deep ResearchBC (test)
Mean Correct Answer Rate620
8
ClusteringBC
Avg Silhouette Score0.557
7
ClusteringBC
ARI77.9
7
Point-level consensus correctness predictionBC
AUPRC97.1
4
Named Entity RecognitionBC (test)
Average F181.54
4
Showing 16 of 16 rows