Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Letter

Benchmarks

Task NameDataset NameSOTA ResultTrend
Symbolic ReasoningLetter
Accuracy92.4
67
ClassificationLETTER (test)
Accuracy96
45
Graph ClassificationLETTER-L TU Dataset
Accuracy98
20
Graph ClassificationLETTER-H TU Dataset
Accuracy81.6
20
LTL Instruction FollowingLetter Finite-horizon (full)
Success Rate (SR)100
19
Constrained Clusteringletter
Success Rate100
18
Multiclass classificationletter (test)
Log Loss (Posterior)0.2656
18
Clusteringletter
CPU Time1.67
17
Clusteringletter
Clustering Inertia123,150.01
17
Binary ClassificationLetter UCI (test)
Accuracy97.5
17
Outlier Detectionletter (historical)
AUROC90.09
17
Off-policy evaluation for classification errorletter
Bias-0.085
15
Outlier Detectionletter (Group II)
AUROC0.9009
14
Binary ClassificationLetter (test)
AUC91.4
13
LTL Instruction FollowingLetter Infinite-horizon (full)
µAcc7.13
10
LTL-guided Reinforcement LearningLetter Finite-horizon (test)
Success Rate (SR)100
9
ClassificationLetter pi+=0.8 UCI (test)
Accuracy97.6
9
ClassificationLetter pi+=0.5 UCI (test)
Accuracy97.5
9
ClassificationLetter (pi+=0.2) UCI (test)
Accuracy97.8
9
Continual ClusteringLetter
AI NMI57.1
8
Outlier Detectionletter (1600, 32) (full)
Recall33
7
Binary Classificationletter LIBSVM (test)
Average AUC0.811
7
ClassificationLETTER (10-fold cross-val)
Test F1 Score65.9
7
ClusteringLetter
Accuracy (ACC)68
6
Off-Policy EvaluationLetter (UCI)
MSE0.0018
6
Showing 25 of 36 rows