Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Synthetic Benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
Triton Kernel GenerationSynthetic Benchmark Overall All Levels
Average Speedup1.57
7
Triton Kernel GenerationSynthetic Benchmark Level 20
Accuracy99
7
Triton Kernel GenerationSynthetic Benchmark (Level 5)
Acc99
7
Triton Kernel GenerationSynthetic Benchmark Level 2
Accuracy96
7
Triton Kernel GenerationSynthetic Benchmark Level 1
Accuracy86.8
7
Shortest Pathsynthetic benchmark
Accuracy95
7
Edge ExistenceSynthetic Benchmark
Accuracy99.7
7
Node DegreeSynthetic Benchmark
Accuracy99.75
7
Triangle Countsynthetic benchmark
Accuracy74.35
7
Cycle Checksynthetic benchmark
Accuracy99.9
7
Edge CountSynthetic Benchmark
Accuracy94.95
7
Node Countsynthetic benchmark 1.0 (test)
Accuracy100
7
Feature AttributionSynthetic benchmark softplus aggregator nonlinear f (test)
MAE0.365
6
Dynamic causal graph trackingSynthetic benchmark semi-synthetic health data (test)
Direction Accuracy91
6
Learning to DeferSynthetic benchmark (test)
Test True Risk28.1
6
CATE estimationSynthetic Benchmark range do(D) ∈ [-2.5, 2.5] (in-sample)
RMSE0.36
5
RegressionSynthetic benchmark with planted ground truth N=1,000, d=8 (test)
R20.961
5
Cluster Validity Index Evaluation10 Synthetic Benchmark Datasets varying d from 10 to 500
Mean SCOPE96.3
5
Online Bayesian calibrationSynthetic benchmark Mixed(3)
Theta RMSE0.02
5
Online Bayesian calibrationSynthetic benchmark Sudden(3)
RMSE (Theta)0.018
5
Online Bayesian calibrationSynthetic benchmark Drifting
RMSE ($ heta$)0.014
5
Bokeh RenderingSynthetic Benchmark
RMSE0.0133
5
Domain AdaptationSynthetic Benchmark
Geometry Score58
4
Generative model evaluation metric validationSynthetic benchmark 2025 (test)
Metric-
0
Showing 24 of 24 rows