Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Letter

Benchmarks

Task NameDataset NameSOTA ResultTrend
Symbolic ReasoningLetter
Accuracy92.4
33
ClassificationLETTER (test)
Accuracy87.1
33
LTL Instruction FollowingLetter Finite-horizon (full)
Success Rate (SR)100
19
Multiclass classificationletter (test)
Log Loss (Posterior)0.2656
18
LTL Instruction FollowingLetter Infinite-horizon (full)
µAcc7.13
10
LTL-guided Reinforcement LearningLetter Finite-horizon (test)
Success Rate (SR)100
9
ClassificationLetter pi+=0.8 UCI (test)
Accuracy97.6
9
ClassificationLetter pi+=0.5 UCI (test)
Accuracy97.5
9
ClassificationLetter (pi+=0.2) UCI (test)
Accuracy97.8
9
Continual ClusteringLetter
AI NMI57.1
8
Outlier Detectionletter (1600, 32) (full)
Recall33
7
Binary Classificationletter LIBSVM (test)
Average AUC0.811
7
ClassificationLETTER (10-fold cross-val)
Test F1 Score65.9
7
Off-Policy EvaluationLetter (UCI)
MSE0.0018
6
ClassificationLETTER
Empirical Risk0.25
6
Active Learning ClassificationLetter
Total Regret2,571
6
Missing Data ImputationLetter
RMSE0.1198
6
LTL-guided Reinforcement LearningLetter Infinite-horizon (test)
µAcc7.13
5
ClusteringLetter
Time0.33
5
Pairwise Learningletter (test)
Average AUC81.1
5
Multiclass Classificationletter
Accuracy97.4
2
LTL-guided Reinforcement LearningLetter Infinite-horizon v1 (test)
µacc-
0
LTL Instruction FollowingLetter Infinite-horizon
µAcc-
0
LTL Instruction FollowingLetter Finite-horizon v1 (test)
Success Rate-
0
Symbolic ReasoningLetter 4
Accuracy-
0
Showing 25 of 25 rows