Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TabArena

Benchmarks

Task NameDataset NameSOTA ResultTrend
Binary ClassificationTabArena
Elo Rating1,782
74
Multiclass ClassificationTabArena Lite
Elo Rating1,662
63
Tabular ClassificationTabArena Binary
Avg Balanced Accuracy89.32
56
RegressionTabArena Regression
Avg Balanced Acc70.11
56
Multiclass ClassificationTabArena Multiclass (6)
Average Balanced Accuracy0.8109
56
Tabular LearningTabArena
Elo1,663
54
RegressionTabArena Lite
Elo1,779
48
Tabular PredictionTabArena all 51 datasets
Elo Rating1,800
38
Tabular PredictionTabArena small (full 681 tasks)
Elo Rating1,723
26
Tabular PredictionTabArena medium 10k-100k (full)
Elo Rating2,146
26
RegressionTabArena
Elo Rating1,959
26
Tabular Data LearningTabArena-Lite
Elo Rating1,651
11
Binary ClassificationTabArena Customer v0.1 (test)
ROC AUC0.738
10
ClassificationTabarena CLS
AUC0.8638
9
RegressionTabarena REG
RMSE0.3919
9
Tabular ClassificationTabArena
Mean Rank AUC OVO6.3
7
Showing 16 of 16 rows