Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Boston

Benchmarks

Task NameDataset NameSOTA ResultTrend
RegressionBoston UCI (test)
RMSE0.2
36
Kernel regressionBoston 20% (test)
RMSE3.303
28
Model Compressionboston
Accuracy / R2 Score87
26
Kernel RegressionBoston 20% n=506 (test)
NLL2.593
20
RegressionBoston
RMSE2.61
17
Regressionboston
R^20.85
13
RF compressionboston
Performance Score85.5
13
RegressionBoston (UCI)
Log-Likelihood-2.301
13
RegressionBoston (test)
NLL2.589
12
RegressionBoston 10% outlier contamination
RMSE3.96
11
RegressionBoston no contamination
RMSE3.24
11
RegressionBoston
RAE0.41
10
RegressionBoston
Average RRSE0.48
10
RegressionBoston
MSE11.789
9
Regressionboston
Training Loss0.3075
8
Bayesian Neural Network InferenceBoston (UCI) (test)
Test Log-Likelihood-2.42
8
Multivariate RegressionBoston
MSE0.0301
6
Prediction Interval estimationboston (test)
PICP91.58
6
RegressionBoston
Average Training Time (s)1.479
6
RegressionBoston
Avg Relative Absolute Error44
6
RegressionBoston
NLL2.35
6
Uncertainty CalibrationBoston
MACE5
6
Faithfulness under retrainingBoston
AURC1.809
5
Out-of-distribution detectionBoston (UCI) (test)
OOD Detection Accuracy96.9
5
RegressionBoston
QICE3.37
5
Showing 25 of 51 rows