Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Letter

Benchmarks

Task NameDataset NameSOTA ResultTrend
Symbolic ReasoningLetter
Accuracy92.4
67
ClassificationLETTER (test)
Accuracy96
52
Online Class-Incremental LearningLetter
Final Mean Accuracy89.5
26
Model ExtractionLetter-low
Fidelity89.2
24
Graph ClassificationLETTER-L TU Dataset
Accuracy98
20
Graph ClassificationLETTER-H TU Dataset
Accuracy81.6
20
LTL Instruction FollowingLetter Finite-horizon (full)
Success Rate (SR)100
19
Constrained Clusteringletter
Success Rate100
18
Multiclass classificationletter (test)
Log Loss (Posterior)0.2656
18
ClusteringLetter
ARI0.189
18
Clusteringletter
CPU Time1.67
17
Clusteringletter
Clustering Inertia123,150.01
17
Binary ClassificationLetter UCI (test)
Accuracy97.5
17
Outlier Detectionletter (historical)
AUROC90.09
17
Off-policy evaluation for classification errorletter
Bias-0.085
15
Continual ClusteringLetter
AI NMI57.1
15
Outlier Detectionletter (Group II)
AUROC0.9009
14
Binary ClassificationLetter (test)
AUC91.4
13
Online Class Incremental LearningLetter
Average Forgetting4.1
11
LTL Instruction FollowingLetter Infinite-horizon (full)
µAcc7.13
10
LTL-guided Reinforcement LearningLetter Finite-horizon (test)
Success Rate (SR)100
9
ClassificationLetter pi+=0.8 UCI (test)
Accuracy97.6
9
ClassificationLetter pi+=0.5 UCI (test)
Accuracy97.5
9
ClassificationLetter (pi+=0.2) UCI (test)
Accuracy97.8
9
Nonstationary backward transferLetter
BWT-ARI-0.038
7
Showing 25 of 45 rows