Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TabArena

Benchmarks

Task NameDataset NameSOTA ResultTrend
Tabular ClassificationTabArena Binary
Avg Balanced Accuracy89.32
56
RegressionTabArena Regression
Avg Balanced Acc70.11
56
Multiclass ClassificationTabArena Multiclass (6)
Average Balanced Accuracy0.8109
56
Binary ClassificationTabArena
Elo Rating1,501
48
Multiclass ClassificationTabArena Lite
Elo Rating1,521
48
RegressionTabArena Lite
Elo1,779
48
Binary ClassificationTabArena Customer v0.1 (test)
ROC AUC0.738
10
ClassificationTabarena CLS
AUC0.8638
9
RegressionTabarena REG
RMSE0.3919
9
Tabular ClassificationTabArena
Mean Rank AUC OVO6.3
7
Showing 10 of 10 rows